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APC ANTIBODIES 

This application is a division, of application Ser. No. 
08/289,548, filed Aug. 12, 1994, which is a division of 
application Ser. No. 07/741,940 filed Aug. 8, 1991 (issued as 5 
U.S. Pat No. 5352,775). 

The U.S. Government has a paid-up license in this 
invention and the right in limited circumstances to require 
the patent owner to license others on reasonable terms as 
provided for by the terms of grants awarded by the National 
Institutes or Health. 

TECHNICAL AREA OF THE INVENTION 

The invention relates to the area of cancer diagnostics and 
therapeutics. More particularly, the invention relates to 
detection of the germline and somatic alterations of wild- 
type APC genes. In addition, it relates to therapeutic inter- 
vention to restore the function of APC gene product 

BACKGROUND OF THE INVENTION 20 

According to the model of Knudson for tumorigenesis 
(Cancer Research, VoL 45, p. 1482, 1985), there are tumor 
suppressor genes in all normal cells which, when they 
become non-functional due to mutation, cause neoplastic 25 
development Evidence for mis model has been found in the 
cases of retinoblastoma and colorectal tumors. The impli- 
cated suppressor genes in those tumors, KB, p53, DCC and 
MCC, were found to be deleted or altered in many cases of 
the tumors studied. (Hansen and Cavenee, Cancer Research, ^ 
VoL 47, pp: 5518-5527 (1987); Baker et aL, Science, VoL 
244, p. 217 (1989); Fearon et aL, Science, VoL 247, p. 49 
(1990); Kinder et aL Science VoL 251. p. 1366 (1991).) 
a ijn order to fully understand the pathogenesis of tumors, it 
fY^.A^ ' will be necessary to identify the other suppressor genes that 35 
play a role in the tumorigenesis process. Prominent among 
these is the one(s) presumptively located at 5q21. Cytoge- 
netic (Herrera et aL, Am /. Med. Genet, VoL 25, p. 473 

(1986) andlinkage (LeppertetaL, Science, VoL238,p. 1411 

(1987) ; Bodmer et aL, Nature, VoL 328, p. 614 (1987)) 40 
studies have shown that this chromosome region harbors the 
gene responsible for familial adenomatous polyposis (FAP) 
and Gardner's Syndrome (GS). FAP is an autosomal- 
dominant, inherited disease in which affected individuals 
develop hundreds to thousands of adenomatous polyps, 45 
some of which progress to malignancy. GS is a variant of 
FAP in which desmoid tumors, osteomas and other soft 
tissue tumors occur together with multiple adenomas of the 
colon and rectum A less severe form of polyposis has been 
identified in which only a few (2-40) polyps develop. This 50 
condition also is familial and is linked to the same chromo- 
somal markers as FAP and GS (Leppert et aL, New England 
Journal of Medicine, VoL 322, pp, 904-908, 1990.) 
Additionally, this chromosomal region is often deleted from 
the adenomas (Vogelstein et aL, N. Engl. J. Med., VoL 319, 55 
p. 525 (1988)) and carcinomas (Vogelstein et aL, N. EngL J. 
Med., VoL 319, p. 525 (1988); Solomon et aL, Nature, VoL 
328, p. 616 (19*7); Sasaki et aL, Cancer Research, VoL 49, 

p. 4402 (1989); Delattre ct aL, Lancet, VoL 2, p. 353 (1989); 
and Ashton-Rickardt et aL, Oncogene, VoL 4, p. 1169 & 
(1989)) of patients without FAP (sporadic tumors). Thus, a 
putative suppressor gene on chromosome 5q21 appears to 
play a role in the early stages of colorectal neoplasia in beth 
sporadic and familial tumorsj 

Although the MCC gene has been identified on 5q21 as a 65 
candidate suppressor gene, it does not appear to be altered 
in FAP or GS patients. Thus there is a need in the art for 
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investigations of this chromosomal region to identify genes 
and to determine if any of such genes are associated with 
FAP and/or GS and the process of tumorigenesis. 

5 SUMMARY OF THE INVENTION 

It is an object of the present invention to provide a method 
for diagnosing and prognosing a neoplastic tissue of a 
human. 

It is another object of the invention to provide a method 
10 of detecting genetic predisposition to cancer. 

It is another object of the invention to provide a method 
of supplying wild-type APC gene function to a cell which 
has lest said gene function. 
15 E is yet another object of the invention to provide a kit for 
determination of the nucleotide sequence of APC alleles by 
the polymerase chain reaction. 

It is still another object of the invention to provide nucleic 
acid probes for detection of mutations in the human APC 
20 gene. 

It is still another object of the invention to provide a 
cDNA molecule encoding the APC gene product 
It is yet another object of the invention to provide a 
25 preparation of the human APC protein. 

fft is another object of the invention to providea method 
#v*$ / of screening for genetic predisposition to cancer} 

It is an object of me invention to provide methods of 
testing therapeutic agents for the ability to suppress neopla- 
30 sia. 

It is still another object of the invention to provide animals 
carrying mutant APC alleles. 

These and other objects of the invention are provided by 
one or more of the embodiments which are described below. 
35 In one embodiment of toe present invention a method of 
diagnosing or prognosing a neoplastic tissue of a human is 
provided comprising; detecting somatic alteration of wild- 
type APC genes or their expression products in a sporadic 
colorectal cancer tissue, said alteration indicating neoplasia 
40 of the tissue. 

In yet another embodiment a method is provided of 
detecting genetic predisposition to cancer in a human includ- 
ing familial adenomatous polyposis (FAP) and Gardner's 
Syndrome (GS), comprising: isolating a human sample 
45 selected from the group consisting of blood and fetal tissue; 
detecting alteration of wild-type APC gene coding 
sequences or their expression products from the sample, said 
alteration indicating genetic predisposition to cancer. 

In another embodiment of the present invention a method 
is provided for supplying wild-type APC gene function to a 
cell which has lost sail gene function by virtue of a mutation 
in the APC gene, comprising: introducing a wild-type APC 
gene into a cell which has lost said gene function such that 
55 said wild-type gene is easpressed in the celL 

In another embodiment a method of supplying wild-type 
APC gene function to a cell is provided comprising: intro- 
ducing a portion of a wild-type APC gene into a cell which 
has lost said gene function such that said portion is 
60 expressedfomecelUsaMpcitionenc^ 

protein which is required for non-neoplastic growth of said 
celL APC protein can also be applied to cells or adrninistered 
to animals to remediate for mutant APC genes. Synthetic 
peptides or drugs can also be used to mimic APC function 
65 in cells which have altered APC expression. 

In yet another embodiment a pair of single stranded 
primers is provided for determination of the nucleotide 
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sequence of the APC gene by polymerase chain reaction. 
The sequence of said pair of single stranded DNA primers is 
derived from chromosome 5q band 21, said pair of primers 
allowing synthesis of APC gene coding sequences. 

In stmanomer embodiment of me fave^^ 5 
probe is provided which is complementary to human wild- 
type APC gene ceding sequences and which can form 
mismatches with mutant APC genes, thereby allowing their 
detection by enzymatic or chemical cleavage or by shifts in 
electrophoretic mobility. 10 

In another embodiment of the invention a method is 
provided for detecting the presence of a neoplastic tissue in 
a human. The method comprises isolating a body sample 
from a human; detecting in said sample alteration of a 
wild-type APC gene sequence or wild4ype APC expression 15 
product, said alteration indicating the presence of a neoplas- 
tic tissue in the human. 

In still another embodiment a cDNAmolecule is provided 
which comprises the coding sequence of the APC gene. 

In even another embodiment a prej^tion cf the human 
APC protein is provided which is substantially free of ouier 
human proteins. The amino add sequence of me protein is 
shown in FIG. 3 (SEQ ID NOS: 7 and 2). 

In yet another embodiment of the invention a method is 
provided for screening for genetic predisposition to cancer, 
including familial adenomatous polyposis (FAP) and Gard- 23 
ner's Syndrome (GS), in a human. Hie method comprises: 
detecting among kindred persons the presence of a DNA^ 
polymorphism which is Imbed to a mutant APC allele in an 
individual having a genetic predisposition to cancer, said 
kindred being genetically related to the individual, ^ the 30 
presence of said polymorphism suggesting a predisposition 
to cancer. 

In another ernbodiment of the invention a method of 
testing therapeutic agents for the ability to suppress a 
neoplasticaliy transformed phenotype is provided. The 35 
method comprises: applying a test substance to acuitared 
epithelial cell which carries a mutation in an APC allele; and 
determining whether said test substance suppresses the 
neoplasticaliy transformed phenotype of the cdL 

In another entoodiment of the invention a method of 40 
testing merapeutic agents for the ability to suppress a 
neoplasticaliy transformed phenotype is provided. The 
method comprises: administering a test substance to an 
animal which carries a mutant APC allele; and cktcrmMng 
whether said test substance prevents or suppresses the ^ 
growth of tumors. 

In still other embodiments of the invention transgenic 
animals are provided The animals carry a mutant APC allele 
from a second animal species or have been genetically 
engineered to contain an insertion mutation which disrupts ^ 
an APC allele. 

The present invention provides the art wim the informa- 
tion that the APC gene, a heretofore unknown gene is, in 
fact, a target of mutational alterations on chromosome 5q21 
and that these alterations are associated with the process of 
tumorigenesis. This information allows highly specific 
assays to be performed to assess the neoplastic status of a 
particular tissue or the predisposition to cancer of an indi- 
vidual. This invention has applicability to Familial 
Adenomatous Polyposis, sporadic colorectal cancers, Gard- 
ner's Syndrome, as well as the less severe familial polyposis w 
discusses above. 

BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. 1A shows an overview of yeast artificial chromo- 
some (YAC) contigs. Genetic distances between selected 65 
RFLP markers from within the contigs are shown in centi- 
Morgans. 
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FIGS. 1B-1, 1B-2 and 1B-3 show a detailed map of the 
three central contigs. The position of the six identified genes 
from within the FAP region is shown; the 5' and 3' ends of 
the transcripts from these genes have in general not yet been 
5 isolated, as indicated by the string of dots surrounding the 
bars denoting the genes' positions. Selected restriction endo- 
nuclease recognition sites are indicated. B, BssH2; S, Sstll; 
<h vM, Mlul; N, NruL 

\r ^/U?GS. 2A and 2B show the sequence of TBI (FIG. 2A) 

io/and TB2 (FIG. 2B) genes. The cDNA sequence of the TBI 
gene was determined from the analysis of 11 cDNA clones 
derived from normal colon and liver, as described in the text 
A total of 2314 bp were contained within the overlapping 
cDNA clones, defining an ORF of 424 amino acids begin- 

15 ning at nucleotide 1. Only the predicted amino acids from 
the ORF are shown. The carboxy-terminal end of the ORF 
has apparently been identified, but the 5' end of the TBI 
transcript has not yet been precisely determined^ 
The cDNA sequence of the TB2 gene was determined 

20 from the YS-39 clone derived as described in the text This 
clone consisted of 2300 bp and defined an ORF of 185 
amino acids beginning at nucleotide 1. Only the predicted 
amino acids are shown. The carboxy terminal end of the 
ORF has apparently been identified, but the 5' end of the 

25 TB2 transcript has not been precisely determined 

pQi^T O^GS.3A-3F show the sequence oftheAPC gene product 
V ( (SEQ ID NO:7). The cDNA sequence was determined 
through the analysis of 87 cDNA clones derived from 

3Q normal colon, liver, and brain. A total of 8973 bp were 
contained within overlapping cDNA clones, defining an 
ORF of 2842 amino acids. In frame stop codons surrounded 
this ORF, as described in the text, suggesting that the entire 
APC gene product was represented in the ORF illustrated. 

35 Only the predicted amino adds are shownTj 

FIGS. 4A and 4B show the local similarity between 
human APC (SEQ ID N02) and ral2 (SEQ ID NO:8) of 
yeast FIG. 4A shows amino acids 203 to 233 of APC, and 
FIG. 4B shows amino acids 453 to 481 of APC Local 

40 similarity among the APC (SEQ ID N02) and MCC genes 
(SEQ ID NO:10) genes and the m3 muscarinic acetylcholine 
receptor (SEQ ID NO:9) is shown. The region of the 
mAChR shown corresponds to mat responsible for coupling 
the receptor to G proteins. The connecting lines indicate 

45 identities; dots indicate related amino acids residues. 

FIG. 5 shows the genomic map of the 1200 Kb Notl 
fragment at the FAP locus. The Notl fragment is shown as 
a bold line. Relevant parts of the deletion chromosomes 
from patients 3214 and 3824 are shown as stippled lines. 

50 Probes used to characterize the Notl fragment and the 
deletions, and three YACs from which subclones were 
obtained, are shown below the restriction map. The chimeric 
end of YAC 183H12 is indicated by a dotted line. The 
orientation and approximate position of MCC are indicated 

55 above the map. 

FIG. 6A-6D show the DNA sequence (SEQ ID NO:3) and 
predicted amino acid sequence of DPI (TB2) (SEQ ID 
NO:4). The nucleotide numbering begins at the most 5* 
nucleotide isolated. A proposed initiation methionine (base 
60 77) is indicated in bold type. The entire coding sequence is 
presented. 

FIG. 7A, FIG. 7B-1, and FIG. 7B-2 show the arrangement 
of exons in DP2.5 (APC). (A) Exon 9 corresponds to 
nucleotides 933-1312; exon 9a corresponds to nucleotides 
65 1236-1312. The stop codon in the cDNA is at nucleotide 
8535. (B) Partial intronic sequence surrounding each exon is 
shown (SEQ ID NO: 11-38). 5' intron sequences of exons 2, 
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3, 4. 5, 6, 7, 8, 9, 10, 11, 12, 13, 14. and 15 arc shown in SEQ 
ID NOS: 12, 14, 16, 18, 20, 22, 24, 26, 28, 30, 32, 34, 36, 
38, respectively. 3* intron sequences of exons 1, 2, 3, 4, 5, 6, 
7, 8, 9, 10, 11, 12, 13, and 14 are shown in SEQ ID NOS: 
11, 13, 15, 17, 19, 21, 23, 25, 27, 29, 31, 33, 35, 37, 
respectively. 

DETAILED DESCRIPTION 

It is a discovery of the present invention that mutational 
events associated with tumorigenesis occur in a previously 
unknown gene on chromosome 5q named here the APC 
(Adenomatous Polyposis Coil) gene. Although it was pre- 
viously known that deletion of alleles on chromosome 5q 
were common in certain types of cancers, it was not known 
that a target gene of these deletions was the APC gene. 
Further it was not known that other types of mutational 
events in the APC gene are also associated with cancers. The 
mutations of the APC gene can involve gross 
rearrangements, such as insertions and deletions. Point 
mutations have also been observed. 20 

According to the diagnostic and prognostic method of the 
present invention, alteration of the wild-type APC gene is 
detected. "Alteration of a wild-type gene" according to the 
present invention encompasses all forms of mutations — ^ 
including deletions. The alteration may be due to either 
rearrangements such as insertions, inversions, and deletions, 
or to point mutations. Deletions may be of the entire gene or 
only a portion of the gene. Somatic mutations are those 
which occur only in certain tissues, eg., in the tumor tissue, ^ 
and are not inherited in the gennline. Gennline mutations 
can be found in any of a body's tissues. If only a single allele 
is somatically mutated, an early neoplastic state is indicated. 
However, if both alleles are mutated then a late neoplastic 
state is indicated. The finding of APC mutations thus pro- 3S 
vides both diagnostic and prognostic information. An APC 
allele which is not deleted (e^^ that on the sister chromo- 
some to a chromosome carrying an APC deletion), can be 
screened for other mutations, such as insertions, small 
deletions, and point mutations. It Is believed that many ^ 
mutations found in tumor tissues will be those leading to 
decreased expression of the APC gene product However, 
mutations leading to non-functional gene products would 
also lead to a cancerous state. Point mutational events may 
occur in regulatory regions, such as in the promoter of the 45 
gene, leading to loss or diminution of expression of the 
mRNA^ Point mutations may also abolish proper RNA 
processing, leading to loss of expression of the APC gene 
product 

In order to detect the alteration of the wild-type APC gene 50 
in a tissue, it is helpful to isolate the tissue free from 
surrounding normal tissues. Means for enriching a tissue 
preparation for tumor cells are known in the art For 
example, the tissue may be isolated from paraffin or cryostat 
sections. Cancer cells may also be separated from normal 55 
cells by flow cytometry. These as well as other techniques 
for separating tumor from normal cells are well known in the 
art If the tumor tissue is highly contaminated with normal 
cells, detection of mutations is more ^fflTffllt. 

Detection of point mutations may be accomplished by 60 
molecular cloning of the APC allele (or alleles) and sequenc- 
ing that allele(s) using techniques well known in the art 
Alternatively, the polymerase chain reaction (PCR) can be 
used to amplify gene sequences directly from a genomic 
DNA preparation from the tumor tissue. The DNA sequence 65 
of the amplified sequences can then be determined. The 
polymerase chain reaction itself is well known in the art. 
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See, e.g., Saiki et aL, Science, Vol 239, p. 487, 1988; U.S. 
Pat No. 4,683,203; and U.S. Pat No. 4,683,195. Specific 
primers which can be used in order to amplify the gene will 
be discussed in more detail below. The ligase chain reaction, 
5 which is known in the art. can also be used to amplify APC 
sequences. See Wu et al., Genomics. Vol. 4, pp. 560-569 
(1989). In addition, a technique known as allele specific 
PCR can be used. (See Ruano and Kidd, Nucleic Acids 
Research, VoL 17, p. 8392, 1989.) According to this 
10 technique, primers are used which hybridize at their J ends 
to a particular APC mutation. If the particular APC mutation 
is not present, an amplification product is not observed. 
Amplification Refractory Mutation System (ARMS) can 
also be used as disclosed in European Patent Application 
15 Publication No. 0332435 and in Newton et al., Nucleic 
Acids Research, VoL 17, p.7, 1989. Insertions and deletions 
of genes can also be detected by cloning, sequencing and 
amplification. In addition, restriction fragment length poly- 
morphism (RFLP) probes for the gene or surrounding 
20 marker genes can be used to score alteration of an allele or 
an insertion in a polymorphic fragment Such a method is 
particularly useful for screening among kindred persons of 
an affected individual for the presence of the APC mutation 
found in that individual Single stranded conformation poly- 
25 moronism (SSCP) analysis can also be used to detect base 
change variants of an allele. (Orita et al., Proc. NatL Acad. 
ScL USA VoL 86, pp. 2766-2770, 1989, and Genomics, VoL 
5, pp. 874-379, 1989.) Other techniques for detecting inser- 
tions and deletions as are known in the art can be used. 
7* ^Alteration of wild-type genes can also be detected on the 
basis of the alteration of a wild-type expression product of 
the gene. Such expression products include bom the APC 
mRNA as well as the APC protein product The sequences 



% of these products are shown in FIG. 3. Point mutations may 



35 be detected by amplifying and sequencing the mRNA or via 
molecular cloning of cDNA made from the mRNA. The 
sequence of the cloned cDNA can be detennined using DNA 
sequencing techniques which are well known in the art The 
cDNA can also be sequenced via the polymerase chain 
40 reaction (PCR) which will be discussed in more detail 
belowTl 

p. %\ mm y r (Mismatches, according to the present invention are 
^\ r hybridized nucleic acid duplexes which are not 100% 
* homologous. The lack of total homology may be due to 

45 deletions, insertions, inversions, substitutions or frameshift 
mutations. Mismatch detection can be used to detect point 
mutations in the gene or its mRNA product While these 
techniques are less sensitive than sequencing, they axe 
simpler to perform on a large number of tumor samples. An 
so example of a rtiicmatrh cleavage technique is the RNase 
protection method, which is described in detail in Winter et 
aL, Proc, NatL Acad. ScL USA, VoL 82, p. 7575, 1985 and 
Meyers et aL, Science, VoL 230, p. 1242, 1985. In the 
practice of the present invention the method involves me use 
55 of a labeled riboprobe which is complementary to the human 
wild-type APC gene coding sequence. The riboprobe and 
either mRNA or DNA isolated from the tumor tissue are 
annealed (hybridized) together and subsequently digested 
with the enzyme RNase A which is able to detect some 
60 mismatches in a duplex RNA structure. If a mismatch is 
detected by RNase A, it cleaves at the site of the mismatch. 
Thus, when the annealed RNApreparation is separated on an 
electrophoretic gel matrix, if a mismatch has been detected 
and cleaved by RNase A, an RNA product will be seen 
65 which is smaller than the full-length duplex RNA for the 
riboprobe and the mRNA or DNA. The riboprobe need not 
be the full length of the APC mRNA or gene but can be a 
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segment of either, n the riboprobe comprises only a segment 
of the APC mRNA or gene it will be desirable to use a 
number of these probes to screen the whole mRNA sequence 



In similar fashion, DNA probes can be used to detect 5 
mismatches, through enzymatic or chemical cleavage. See, 
e.g., Cotton et at, Proc. Nad. Acad. ScL USA, VoL 85, 4397, 
1988; and Shenk et al., Proc. Natl. Acad. ScL USA, Vol. 72, 
p. 989, 1975. Alternatively, mismatches can be detected by 
shifts in the electrophoretic mobility of mismatched 10 
duplexes relative to matched duplexes. See, e.g., Cariello, 
Human Genetics, Vol. 42, p. 726, 1988. With either ribo- 
probes or DNA probes, the cellular mRNA or DNA which 
might contain a mutation can be amplified using PGR (see 
below) before hybridization. Changes in DNA of the APC 15 
gene can also be detected using Southern hybridization, 
especially if the changes are gross rearrangements, such as 
deletions and insertions. 

DNA sequences of the APC gene which have been 
amplified by use of polymerase chain reaction may also be 20 
screened using allele-specific probes. These probes are 
nucleic acid oligomers, each of which contains a region of 
the APC gene sequence harboring a known mutation. For 
example, one oligomer may be about 30 nucleotides in 
length, corresponding to a portion of the A PC gene 25 
sequence. By use of a battery of such allele-specific probes, 
PCR amplification products can be screened to identify the 
presence of a previously identified mutation in the APC 
gene. Hybridization of allele-specific probes with amplified 
APC sequences can be performed, for example, on a nylon 30 
filter. Hybridization to a particular probe under stringent 
hybridization conditions indicates the presence of the same 
mutation in the tumor tissue as in the allele-specific probe. 

Alteration of APC mRNA expression can be detected by 
any technique known in the art These include Northern blot 35 
analysis, PCR amplification and RNase protection. Dimin- 
ished mRNA expression indicates an alteration of the wild- 
type APC gene. Alteration of wild-type APC genes can also 
be detected by screening for alteration of wild-type APC 
protein. For example, monoclonal antibodies immonoreac- 40 
tive with APC can be used to screen a tissue. Lack of cognate 
antigen would indicate an APC mutation. Antibodies spe- 
cific for products of mutant alleles could also be used to 
detect mutant APC gene product Such immunological 
assays can be done in any convenient format known in the 45 
art These include Western blots, inimunohistochemical 
assays and FITS A assays. Any means for detecting an 
altered APC protein can be used to detect alteration of 
wild-type APC genes. Functional assays can be used, such as 
protein binding determinations. For example, it is believed so 
that APC protein oligomerizes to itself and/or MCC protein 
or binds to a G protein. Thus, an assay for the ability to bind 
to wild type APC or MCC protein or that G protein can be 
employed. In addition, assays can be used which detect APC 
biochemical function. It is believed that APC is involved in 55 
phospholipid metabolism. Thus, assaying the enzymatic 
products of the involved phospholipid metabolic pathway 
can be used to determine APC activity. Finding a mutant 
APC gene product indicates alteration of a wild-type APC 
gene. eo 

Mutant APC genes or gene products can also be detected 
in other human body samples, such as, serum, stool, urine 
and sputum. The same techniques discussed above for 
detection of mutant APC genes or gene products in tissues 
can be applied to other body samples. Cancer cells are 65 
sloughed off from tumors and appear in such body samples. 
In addition, the APC ge ne product itself may be secreted into 
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the extracellular space and found in these body samples even 
in the absence of cancer cells. By screening such body 
samples, a simple early diagnosis can be achieved for many 
types of cancers. In addition, the progress of chemotherapy 
5 or radiotherapy can be monitored more easily by testing 
such body samples for mutant APC genes or gene products. 

The methods of diagnosis of the present invention are 
applicable to any tumor in which APC has a role in tum- 
origenesis. Deletions of chromosome arm 5q have been 
observed in tumors of lung, breast colon, rectum, bladder, 
liver, sarcomas, stomach and prostate, as well as in leuke- 
mias and lymphomas. Thus these are likely to be tumors in 
which APC has a role. The diagnostic method of the present 
invention is useful for clinicians so that they can decide upon 
an appropriate course of treatment For example, a tumor 
15 displaying alteration of both APC alleles miglit suggest a 
more aggressive therapeutic regimen man a tumor display- 
ing alteration of only one APC allele. 
The primer pairs of the present invention are useful for 
2P determination of the nucleotide sequence of a particular 
APC allele using the polymerase chain reaction. The pairs of 
single stranded DNA primers can be annealed to sequences 
within or surrounding the APC gene on chromosome 5q in 
order to prime amplifying DNA synthesis of the APC gene 
„ itself. A complete set of these primers allows synthesis of all 
of the nucleotides of the APC gene coding sequences, i.e., 
the exons. The set of primers preferably allows synthesis of 
both intron and exon sequences. Allele specific primers can 
also be used. Such primers anneal only to particular APC 
3Q mutant alleles, and thus will only amplify a product in the 
presence of the mutant allele as a template. 

In order to facilitate subsequent cloning of amplified 
sequences, primers may have restriction enzyme site 
sequences appended to their 5* ends. Thus, all nucleotides of 
35 the primers are derived from APC sequences or sequences 
adjacent to APC except the few nucleotides necessary to 
form a restriction enzyme site. Such enzymes and sites are 
well known in the art The primers themselves can be 
synthesized using techniques which are well known in the 
40 art Generally, the primers can be made using oligonucle- 
otide synthesizing machines which are commercially avail- 
able. Given the sequence of the APC open reading frame 
shown in FIG. 3 (SEQ ID NO:l), design of particular 
primers is well within the skill of the art 
45 The nucleic acid probes provided by the present invention 
are useful for a number of purposes. They can be used in 
Southern hybridization to genomic DNA and in the RNase 
protection method for detecting point mutations already 
discussed above. The probes can be used to detect PGR 
50 amplification products. They may also be used to detect 
mismatches with the APC gene or mRNA using other 
techniques. Mismatches can be detected using either 
enzymes (e.g., SI nuclease), chemicals (e.g., hydYcxyianrine 
or osmium tetroxide and piperidine), or changes in electro- 
55 phoretic mobility of mismatched hybrids as compared to 
totally matched hybrids. These techniques are known in the 
art See, Cotton, supra, Shenk, supra, Myers, supra, Winter, 
supra, and Novack et at, Proa Natl. Acad. Sd. USA, VoL 
83, p. 586, 1986. Generally, the probes are complementary 
eo to APC gene coding sequences, although probes to certain 
introns are also contemplated. An entire battery of nucleic 
acid probes is used to compose a kit for detecting alteration 
of wild-type APC genes. The kit allows for hybridization to 
the entire APC gene. The probes may overlap with each 
65 other or be contiguous. 

If a riboprobe is used to detect mismatches with mRNA, 
it is complementary to the mRNA of the human wild-type 
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APC gene. The riboprobe thus is an anti-sense probe in that 
it does not code for the APC protein because it is of the 
opposite polarity to the sense strand. The riboprobe gener- 
ally will be labeled with a radioactive, colorimetric, or 
fluorometric material, which can be accomplished by any 5 
means known in the art If the riboprobe is used to detect 
mismatches with DNA it can be of either polarity, sense or 
anti-sense. Similarly, DNA probes also may be used to detect 
mismatches. 

Nucleic acid probes may also be complementary to 10 
mutant alleles of the APC gene. These are useful to detect 
similar mutations in other patients on the basis of hybrioV 
ization rather man mismatches. These are discussed above 
and referred to as allele-specific probes. As mentioned 
above, the A PC probes can also be used in Southern 15 
hybridizations to genomic DNA to detect gross chromo- 
somal changes such as deletions and insertions. The probes 
can also be used to select cDNA clones of APC genes from 
tumor and normal tissues. In addition, the probes can be used 
to detect APC mRNA in tissues to determine if expression is 20 
diminished as a result of alteration of wild-type APC genes. 
<& £ /Recording to the present invention a method is also 
provided of supplying wild-type APC function to a cell 
which carries mutant APC alleles. Supplying such function 
should suppress neoplastic growth of the recipient cells. The 25 
wild-type APC gene or a part of the gene may be introduced 
into the cell in a vector such that the gene remains extra- 
chromosomaL In such a situation the gene will be expressed 
by the cell from the extachromosomal location. If a gene 
portion is introduced and expressed in a cell carrying a 30 
mutant APC allele, the gene portion should encode a part of 
the APC protein which is required for non-ncoplastic growm 
of the celL More preferred is the situation where the wild- 
type APC gene or a part of it is introduced into the mutant 
cell in such a way mat it recombines with the endogenous 35 
mutant APC gene present in the celL Such recombination 
requires a double recombination event which results in me 
correction of the APC gene mutation. Vectors for introduc- 
tion of genes beth for recombination and for extracfaromo- 
somal maintenance are known in the art and any suitable 40 
vector may be used. Methods for introducing DNA into cells 
such as dectroporation, calcium phosphate co-rredpitation 
and viral transduction are known in the art and the choice of 
method is within the competence of the routineer. Cells 
transformed with the wild-type A PC gene can be used as 45 
model systems to study cancer remission and drug treat- 
ments .which promote such rernissionT} 
[ ^7 Similarly, cells and animals which carry a mutant APC 
^ * allele can be used as model systems to study and test for 

substances which have potential as therapeutic agents. The 50 
cells are typically cultured epithelial cells. These may be 
isolated from individuals with APC mutations, either 
somatic or gennline. Alternatively, the cell line can be 
engineered to carry the mutation in the APC allele. After a 
test substance is applied to the cells, the neoplastically 55 
transformed pheno-type of the cell will be determined. Any 
trait of neoplastically transformed cells can be assessed, 
including anchorage-independent growth, tumorigeniciry in 
nude mice, invasiveness of cells, and growm factor depen- 
dence. Assays for each of these traits are known in the artjeo 

Animals for testing therapeutic agents can be selected 
after mutagenesis of whole animals or after treatment of 
gennline cells or zygotes. Such treatments include insertion 
of mutant A PC alleles, usually from a second animal 
species, as well as insertion of disrupted homologous genes. 65 
Alternatively, the endogenous APC gene(s) of the animals 
may be disrupted by insertion or deletion mutation. After test 
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substances have been administered to the animals, the 
growth of tumors must be assessed. If the test substance 
prevents or suppresses the growth of tumors, then the test 
substance is a candidate therapeutic agent for the treatment 
5 of FAP and/or sporadic cancers. 
r$£ Polypeptides which have APC activity can be supplied to 
cells wn ich carry mutant or missing APC alleles. The 
sequence of the APC protein is disclosed in FIG. 3 (SEQ ID 
NO:7). These two sequences differ slightly and appear to be 
io indicate the existence of two different forms of the APC 
protein. Protein can be produced by expression of the cDNA 
sequence in bacteria, for example, using known expression 
vectors. Alternatively, APC can be extracted from APC- 
producing r"*™™* 1 ^" cells such as brain cells. In addition, 
is the techniques of synthetic chemistry can be employed to 
synthesize APC protein. Any of such techniques can provide 
the preparation of the present invention which comprises the 
APC protein. The preparation is substantially free of other 
human proteins. This is most readily accomplished by 
20 synthesis in a microorganism or in vitro/} 

Active APC molecules can be introduced into cells by 
microinjection or by use of liposomes, for example. 
Alternatively, some such active molecules may be taken up 
by cells, actively or by diffusion. Extracellular application of 
25 APC gene product may be sufficient to affect tumor growth. 
Supply of molecules with APC activity should lead to a 
partial reversal of the neoplastic state. Other molecules with 
APC activity may also be used to effect such a reversal, for 
example peptides, drugs, or organic compounds. 
30 The present invention also provides a preparation of 
antibodies immunoreactive with a human APC protein. The 
antibodies may be polyclonal or monoclonal and may be 
raised against native APC protein, APC fusion proteins, or 
mutant APC proteins. The antibodies should be immnnore- 
35 active with APC epitopes, preferably epitopes not present on 
other human proteins. In a preferred einbodiment of the 
invention the antibodies will inimunoprecipitate APC pro- 
teins from solution as well as react with APC protein on 
Western or immunoblots of polyacrylamide gels. In another 
40 preferred embodiment, the antibodies will detect APC pro- 
teins in paraffin or frozen tissue sections, using immunocy- 
tochemical techniques. Techniques for raising and purifying 
antibodies are well known in the art and any such techniques 
may be chosen to achieve the preparation of the invention. 
45 Predisposition to cancers as in FAP and GS can be 
ascertained by testing any tissue of a human for mutations of 
the APC gene. For example, a person who has inherited a 
germline APC mutation would be prone to develop cancers. 
50 This can be determined by testing DNA from any tissue of 
the person's body. Most simply, blood can be drawn and 
DNA extracted from the cells of the blood. In addition, 
prenatal diagnosis (an be accomplished by testing fetal cells, 
placental cells, or amniotic fluid for mutations of the APC 
gene. Alteration of a wild-type APC allele, whether for 
example, by point mutation or by deletion, can be detected 
by any of the means discussed above. 

Molecules of cDNA according to the present invention are 
intron-fiee, APC gene ceding molecules. They can be made 
60 by reverse transcriptase using the APC mRNA as a template. 
These molecules can be propagated in vectors and cell lines 
as is known in the art Such molecules have the sequence 
shown in SEQ ID N03. The cDNA can also be made using 
the techniques of synthetic chemistry given the sequence 
65 disclosed herein. 
p> £A short region of homology has been identified between 
Vy( APC and the human m3 muscarinic acetylcholine receptor 
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(mAChR). This homology was largely confined to 29 resi- 
dues in which 6 out of 7 amino acids (EL(GorA)GLQA) 
were identical (See FIG. 4 (SEQ ID NO: 9)). Initially, it was 
not known whether this homology was significant, because 
many other proteins had higher levels of global homology 5 
(though few had six out of seven contiguous amino acids in 
common). However, a study on the sequence elements 
controlling G protein activation by mAChR subtypes 
(Lechleiter et aL, EMBO J., p. 4381 (1990)) has shown that 
a 21 amino acid region from the m3 mAChR completely 10 
mediated G protein specificity when substituted for the 21 
amino acids of m2 mA ChR at the analogous protein 
position. These 21 residues overlap the 19 amino acid 
homology between APC and m3 mA ChRT\ 

This connection between APC and the G protein activat- 15 
ing region of mAChR is intriguing in light of previous 
investigations relating G proteins to cancer. Far example, the 
RAS oncogenes, which are often mutated in colorectal 
cancers (Vogelstein, et aL, N. Engl. J. Med., VoL 3 19, p. 525 

(1988) ; Bos et aL, Nature Vol. 327, p. 293 (1987)), are 2° 
members of the ( 1 protein family (Bourne, et al, Nature, VoL 
348, p. 125 (1990)) as is an in vitro transformation suppres- 
sor (Noda et al., Proc. Natl Acad. ScL USA, VoL 86, p. 162 

(1989) ) and genes mutated in hormone producing tumors 
(Candis et al., Nature, VoL 340, p. 692 (1989); Lyons et aL, & 
Science, VoL 249, p. 655 (1990)). Additionally, the gene 
responsible for neurofibromatosis (presumably a tumor sup- 
pressor gene) has been shown to activate the GTPase 
activity of RAS (Xu et aL, Cell, VoL 63, p. 835 (1990); 
Martin et al.. Cell, VoL 63, p. 843 (1990); Ballester et aL, 30 
Cell, VoL 63, p. 851 (1990)). Another interesting link 
between G proteins and colon cancer involves the drug 
sulindac. This agent has been shown to inhibit the growth of 
benign colon tumors in patients with PAP, presumably by 
virtue of its activity as a cyclooxygenase inhibitor (Waddell 35 
et aL, J. Surg. Oncology 24(1), 83 (1983); Wadell, et aL, Am. 

J. Surg., 157(1), 175 (1989); Chameau et aL, Gastroenter- 
ologieClinique atBiologique 14(2), 153 (1990)). Cyclooxy- 
genase is required to convert arachidonic add to prostag- 
landins and other biologically active molecules. G proteins 40 
are known to regulate phospholipase A2 activity, which 
generates arachidonic add from phospholipids (Role et aL, 
Proc. Natl. Acad. ScL USA, VoL 84, p. 3623 (1987); Kurachi 
et aL, Nature, VoL 337, 12 555 (1989)). Therefore we 
propose that wild-type APC protein functions by interacting 45 
with a G prottin and is involved in phospholipid metabo- 
lism. 

The following are provided for exemplification purposes 
only and are not intended to limit the scope of the invention ^ 
which has been described in broad terms above. 



f\^> fThis example demonstrates the isolation of a 5.5 Mb 
9^ ( re^on of human DNAliiike^^ 55 



identified in this region, all of which are expres sed in normal 
colon cells and in colorectal, lung, ad bladder tumorsT} 



The cosmid markers YN5.64 and YN5.48 have previously 
been shown to delimit an 8 cM region containing the locus 
for FAP (Nakamura et aL, Am. J. Hum, Genet VoL 43, p. 60 
638 (1988)). Further linkage and pulse-field gel electro- 
phoresis (PFGE) analysis with additional markers has shown 
that the FAP locus is contained within a 4 cM region 
bordered by cosmids EF5.44 and L5.99. In order to isolate 
dones representing a significant portion of this locus, a yeast 65 
artificial chromosome (YAQ library was screened with 
various 5q21 markers. Twenty-one YAC clones, distributed 
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within six contigs and including 5.5 Mb from the region 
between YN5.64 and YN5.48, were obtained (FIG. 1A). 
/i & £3iree contigs encompassing approximately 4 Mb were 

5W* .CA^ contained within the central portion of this region. The 

5 YACso>nstimtiiigmesecx>ntigs,togemerwimthemarkeK 
used for their isolation and orientations, are shown in FIG. 
1. These YAC contigs were obtained in the following way. 
To initiate each contig, the sequence of a genomic marker 
cloned from chromosome 5q21 was determined and used to 
design primers for PCR. PCR was then carried out on pools 
of YAC clones distributed in microtiter trays as previously 
described (Anand et aL, Nucleic Acids Research, Vol 18, p. 
195 1 (1980)). Individual YAC clones from the positive pools 
were identified by further PCR or hybridization based 
assays, and the YAC sizes were determined by FFGEJ 
15 fib extend the areas covered by the original YAC clones, 
O aA C T7 "chromosomal walking" was performed. For this purpose, 
CT < YAC termini were isolated by a PCR based method and 

sequenced (Riley et aL, Nucleic Acids Research, VoL 18, p. 
2887 (1990)). PCR primers based on these sequences were 
20 then used to rescreen the YAC library. For example, the 
sequence from an intron of the FER gene (Hao et aL, MoL 
CelL BioL, VoL 9, p. 1587 (1989)) was used to design PCR 
primers for isolation of the 28EC1 and 5EH8 YACs. The 
„ termini of the 28EC1 YAC were sequenced to derive mark- 
ers RHE28 and LHE28, respectively. The sequences of these 
two markers were then used to isolate YAC clones 15CH12 
(from RHE28) and 40CF1 and 29EF1 (from LHE28). These 
five YACs formed a contig encompassing 1200 kb (contig 
30 1,FIG.1B)7I 

Similarly, contig 2 was initiated using cosmid N5.66 
sequences, and contig 3 was initiated using sequences both 
from the MCC gene and from cosmid EF5.44. A walk in the 
telomeric direction from YAC 14FH1 and a walk in the 
« opposite direction from YAC 39GG3 allowed connection of 
the initial contig 3 clones through YAC 37HG4 (FIG. IB). 
YAC37HG4 was deposited at the National Collection of 
Industrial and Marine Bacteria (NQMB), P.O. Box 31, 23 
St Machar Drive, Aberdeen AB2 1RY, Scotland, under 
40 Accession No. 40353 on Dec. 17, 1990. 
A iD^x Multipoint linkage analysis with the various markers used 
<yOt • f to (Serine the contigs, combined with PFGE analysis, showed 
that contigs 1 and 2 were centromeric to contig 3. These 
contigs were used as tools to orient and/or identify genes 
45 which might be responsible for FAR Six genes were found 
to lie within mis cluster of YACs, as follows:^ 

Contig #1: FER— The FER gene was discovered through 
its homology to the viral oncogene ABL (Hao et aL, supra). 
It has an intrinsic tyrosine kinase activity, and in situ 
50 hybridization with an FER probe showed mat the gene was 
located at 5qll-23 (Morris et aL, Cytogenet CelL Genet., 
VoL 53, p. 4, (1990)). Because of the potential role of this 
oncogene-related gene in neoplasia, we decided to evaluate 
it further with regards to the FAP locus. A human genomic 
55 clone from FER was isolated (MF 23) and used to define a 
restriction fragment length polymorphism (RFLP), and the 
RFLP in turn used to map FER by linkage analysis using a 
panel of three generation families. This showed that FER 
was very tightly linked to previously defined polymorphic 
60 markers for the FAP locus. The genetic mapping of FER was 
complemented by physical mapping using the YAC clones 
derived from FER sequences (FIG. IB). Analysis of YAC 
contig 1 showed that FER was within 600 kb of cosmid 
marker M5.28, which maps to within 13 Mb of cosmid 
65 L5 99 by PFGE of human genomic DN A. Th us, the YAC 
mapping results were consistent with the FER linkage data 
and PFGE analyses. 



ft */ {fontig 2: TBI— TBI was identified through a cross- 
( hybridization approach. Exons of genes are often evolution- 
arily conserved while introns and intergenie regions are 
much less conserved. Thus, it a human probe cross- 
hybridizes strongly to the DNA from non-primate species, 5 
mere is a reasonable chance that it contains exon sequences. 
Subclones of the cosmids shown in FIG. 1 were used to 
screen Southern blots containing rodent DNA samples. A 
subclone of cosmid N5.66 (p 5.66-4) was shown to strongly 
hybridize to rodent DNA, and this clone was used to screen 10 
cDNA libraries derived from normal adult colon and fetal 
liver. The ends of the initial cDNA clones obtained in this 
screen were then used to extend the cDNA sequence. 
Eventually, 11 cDNA clones were isolated, covering 2314 
bp. The gene detected by these clones was named TBI. 15 
Sequence analysis of the overlapping clones revealed an 
open reading frame (ORF) that extended for 1302 bp starting 
from the most 5* sequence data obtained (FIG. 2A). If mis 
entire open reading frame were translated, it would encode 
434 amino acids (SEQ ID NO:5). The product of this gene 
was not globally homologous to any other sequence in the 
current database but showed two significant local similari- 
ties to a family of ADP, ATP carrier/translocator proteins and 
mitochondrial brown fat uncoupling proteins which are 
widely distributed from yeast to mammals. These conserved 25 
regions of TBI (underlined in FIG. 2A) may define a 
predictive motif for this sequence family. In addition, TBI 
appeared to contain a signal peptide (or mitochondrial 
targeting sequence) as well as at least 7 transmembrane 
domainO 30 

Contig 3: MCC, TB2, SRP and APC— The MCC gene 
was also discovered through a cross-hybridization approach, 
as described previously (Kinzler et aL, Science VoL 251, p. 
1366 (1991)). The MCC gene was considered a candidate 
for causing FAP by virtue of its tight genetic linkage to FAP 35 
susceptibility and its somatic mutation in sporadic colorectal 
carcinomas. However, mapping experiments suggested that 
the ceding region of MCC was approximately 50 kb proxi- 
mal to the centromeric end of a 200 kb deletion found in an 
FAP patient MCC cDN A probes detected a 10 kb mRNA 40 
transcript on Northern blot analysis of which 4151 bp, 
including the entire open reading frame, have been cloned. 
Although the 3* non-translated portion or an alternatively 
spliced form of MCC might have extended into this deletion, 
it was possible that the deletion did not affect the MCC gene 45 
product We therefore used MCC sequences to initiate a 
YAC contig, and subsequently used the YAC clones to 
identify genes 50 to 250 kb distal to MCC mat might be 
contained within the deletion. 

In a first approach, the insert from YAC24ED6 (FIG. IB) 50 
was radiolabeled and hybridized to a cDNA library from 
normal colon. One of the cDNA clones (YS39) identified in 
this manner detected a 3.1 kb mRNA transcript when used 
as a probe for Northern blot hybridization. Sequence analy- 
sis of the YS39 clone revealed that it encompassed 2283 55 
nucleotides and contained an ORF that extended for 555 bp 
from the most 5* sequence data obtained. If all of mis ORF 
were translated, it would encode 185 amino acids (SEQ ID 
NO:6) (FIG. 2B). The gene detected by YS39 was named 
TB2. Searches of nucleotide and protein databases revealed 60 
that the TB2 gene was not identical to any previously 
reported sequences nor were there any striking similarities. 

Another clone (YS 11) identified through the YAC 24ED6 
screen appeared to contain portions of two distinct genes. 
Sequences from one end of YS11 were identical to at least 65 
180 bp of the signal recognition particle protein SRP19 
(Lingelbach et al. Nucleic Acids Research, VoL 16, p. 943 1 
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(1988). A second ORF, from the opposite end of clone YS11, 
proved to be identical to 78 bp of a novel gene which was 
independently identified through a second YAC-based 
approach. For the latter, DNA from yeast cells containing 
5 YAC 14FH1 (FIG. IB) was digested with EcoRI and sub- 
cloned into a plasmid vector. Piasmids mat contained human 
DNA fragments were selected by colony hybridization using 
total human DNA as a probe. These clones were then used 
to search for cross-hybridizing sequences as described above 
10 forTBl, and the cross-hybridizing clones were subsequently 
used to screen cDNA libraries. One of the cDNA clones 
discovered in this way (FIB 8) contained a long ORF (2496 
bp), 78 bp of which were identical to the above-noted 
sequences in YS1 1. The ends of the FH38 cDNA clone were 
15 then used to initiate cDNA walking to extend the sequence. 
Eventually, 85 cDNA clones were isolated from normal 
colon, brain and liver cDNA libraries and found to encom- 
pass 8973 nucleotides of contiguous transcript The gene 
corresponding to mis transcript was named APC When used 
20 as probes for Northern blot analysis, APC cDNA clones 
hybridized to a single transcript of approximately 93 kb, 
suggesting that the great majority of the gene product was 
represented in the cDNA clones obtained. Sequences from 
the 5* end of the APC gene were found in YAC 37HG4 but 
25 not in YAC 14FH1. However, the 3* end of the APC gene 
was found in 14FH1 as well as 37HG4. Analogously, the 5* 
end of the MCC ceding region was found in YAC clones 
19AA9 and 266C3 but not 24ED6 or 14FH1, while the 3' 
end displayed the opposite pattern. Thus, MCC and APC 
30 transcription units pointed in opposite directions, with the 
direction of transcription going from centromeric to telom- 
eric in the case of MCC, and telomeric to centromeric in the 
case of APC PFGE analysis of YAC DNA digested with 
various restriction endonucleases showed that TB2 and SRP 
35 were between MCC and APC, and that the 3' ends of the 
ceding regions of MCC and APC were separated by approxi- 
mately 150 kb (FIG. IB). 
/Sequence analysis of die APC cDNA clones revealed an 
open reading frame of 84135 nucleotides. The 5* end of the 
40 ORF contained a methionine codon (codon 1) mat was 
preceded by an in-frame stop codon 9 bp upstream, and the 
3* end was followed by several in-frame stop codons. The 
protein produced by initiation at codon 1 would contain 
2,842 amino acids (SEQ ID NO:7) (FIG. 3). The results of 
45 database searching with the APC gene product were quite 
complex due to the presence of large segments with locally 
biased amino acid compositions. In spite of this, APC could 
be roughly divided into two domains. The N-terminal 25% 
of the protein had a high content of leucine residues (12%) 
50 and showed local sequence similarities to myosins, various 
intermediate filament proteins (e.g., desmin, vimentin, 
neurofilaments) and Drosophila armadillo/human plakoglo- 
bin. The latter protein is a component of adhesive junctions 
(desmosomes) joining epithelial cells (Franke et aL, Proa 
55 Natl. Acad. ScL U.SA-, VoL 86, p. 4027 (1989);Perfer et aL, 
Cell, Vol. 63, p. 1167 (1990)) The C-terminal 75% of APC 
(residues 731-2832) is 17% serine by composition with 
serine residues more or less uniformly distributed. This large 
domain also contains local concentrations of charged 
60 (mostly acidic) and proline residues. There was no indica- 
tion of potential signal peptides, transmembrane regions, or 
nuclear targeting signals in APC, suggesting a cytoplasmic 
localization?! 

To detect short similarities to APC, a database search was 
: 65 perfomedusmgmePAM^inatrix(AltschuL J.MoLBio., 
Vol. 219, p. 555 (1991). Potentially interesting matches to 
several proteins were found. The most suggestive of these 
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involved the ral2 gene product of yeast, which is implicated 
in the regulation of ras activity (Fukul et aL, MoL CelL BioL, 
Vol. 9, p. 5617 (1989)). Little is known about how ra!2 might 
interact with ras but it is interesting to note the positively- 
charged character of this region in the context of the 5 
negatively-charged GAP interaction region of ras. A specific 
electrostatic interaction between ras and GAP-reiated pro- 
teins has been proposed. 
-» Qjecause of the proximity of the MCC and APC genes, and 
Q^lrt- *G\ f $~ I mc fact ttat both am implicated in colorectal tumorigenesis, 10 
^ we searched for similarities between the two predicted 

proteins. Bourne has previously noted that MCC has the 
potential to form alpha helical coiled coils (Nature, VoL 35 1, 
p. 188 (1991). Lupas and colleagues have recently devel- 
oped a program for predicting coiled coil potential from 
primary sequence data (Science, VoL 252, p. 1162 (1991) 
and we have used their program to analyze bom MCC and 
AFC Analysis of MCC indicated a discontinuous pattern of 
coiled-coil domains separated by putative "hinge*' or 
"sparer" regions similar to those seen in laminin and other 20 
intermediate filament proteins. Analysis of the APC 
sequence revealed two regions in the N-terminal domain 
which had strong coiled con-forming potential, and these 
regions corresponded to those that showed local similarities 
with myosin and IF proteins on database searching. In 25 
addition, one other putative coiled coil region was identified 
in the central region of APC. The potential for both APC and 
MCC to form coiled coils is interesting in that such struc- 
tures often mediate homo- and hetero-oligomerizationn 

Finally, it had previously been noted that MCC shared a 30 
short similarity with the region of the m3 muscarinic ace- 
tylcholine receptor (mAChR) known to regulate specificity 
of G-protein coupling. The APC gene also contained a local 
similarity to the region of the m3 mAChR (SBQID NO:9) 
that overlapped with the MCC similarity (SEQ ID NO:10) 35 
(FIG. 4B). Although the similarities to ra!2 (SEQ ID NO:8) 
(FIG. 4A) and m3 mAChR (SEQ ID NO 3) (FTCL 4B) were 
not statistically significant, they were fnrrigring in light of 
previous observations relating G-proteins to neoplasia. ^ 

Each of the six genes described above was expressed in 
normal colon mucosa, as indicated by their representation in 
colon cDNA libraries. To study expression of the genes in 
neoplastic colorectal epithelium, we employed reverse 
transcription-polymerase chain reaction (PGR) assays. 4S 
Primers based on the sequences of PER, TBI, TB2, MCC, 
and APC were each used to design primers for PCR per- 
formed with cDNA templates. Each of these genes was 
found to be expressed in normal colon, in each of ten cell 
lines derived from colorectal cancers, and in tumor cell lines ^ 
derived from lung and bladder tumors. The ten colorectal 
cancer cell lines included eight from patients with sporadic 
CRC and two from patients with FAR 
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This example demonstrates a genetic analysis of the role 
of the FER gene in FAP and sporadic colorectal cancers. 

We considered FER as a candidate because of its prox- 
imity to the FAP locus as judged by physical and genetic 
criteria (see Example 1), and its homology to known 60 
tyrosine kinases with oncogenic potential. Primers were 
designed to PCR-amplify the complete coding sequence of 
FER from the RNA of two colorectal cancer cell lines 
derived from FAP patients. cDNA was generated from RNA 
and used as a template for PCR. The primers used were 65 
S'-AGAAGGATCCCTTGrGCAGrGrGGA-S* (SEQ ID 
NOS5) and 5'-GACAG Q ATCCTGAAGCTGAGTTTG-3 ' 
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(SEQ ID NO:96). The underlined nucleotides were altered 
from the true FER sequence to create BamHI sites. The cell 
lines used were JW and Difi, both derived from colorectal 
cancers of FAP patients. (C. Paraskeva, B. G. Buckle, D. 

5 Sheer, C. B. Wigley, InL J. Cancer 34, 49 (1984); M. E. 
Gross et aU Cancer Res. 51, 1452 (1991). The resultant 2554 
basepair fragments were cloned and sequenced in their 
entirety. The PCR products were cloned in the BamHI site 
of Bluescript SK (Stratagene) and pools of at least 50 clones 

10 were sequenced en masse using T7 polymerase, as described 
in Nigra et aL, Nature 342, 705 (1989). 

Only a single conservative amino acid change 
(GTG-»CTG, creating a val to leu substitution at codon 439) 
was observed. The region surrounding this codon was then 

15 amplified from the DNA of individuals without FAP and this 
substitution was found to be a common polymorphism, not 
specifically associated with FAP. Based on these results, we 
considered it unlikely (though still possible) the FER gene 
was responsible for FAP. To amplify the regions surrounding 

20 codon 439, the following primers were used: 
5'-T<^GAAACTGCTGAAGAG-3' (SEQ ID NO:97) and 
S'-GGAATAArrAGGrCrCCAA-S* (SEQ ID NO:98). PCR 
products were digested with PstI, which yields a 50 bp 
fragment if codon 439 is leucine, but 26 and24 bp fragments 

25 if it is valine. The primers used for sequencing were chosen 
from the FER cDNA sequence in Hao et aL, supra. 

EXAMPLE 3 

— This example demonstrates the genetic analysis of MCC, 
TB2, SRP and APC in FAP and sporadic colorectal tumors. 
Each of these genes is linked and encompassed by contig 3 
(see FIG. 1). 

Several lines of evidence suggested that this contig was of 
35 particular interest First, at least three of the four genes in 
this contig were within the deleted region identified in two 
FAP patients. (See Example 5 infra.) Second, allelic dele- 
tions of chromosome 5q21 in sporadic cancers appeared to 
be centered in this region. (Ashton-Rickardt et aL, 
40 Oncogene, in press; and MM et aL, Japn. J. Cancer Res., in 
press.) Some tumors exhibited loss of proximal RFLP mark- 
ers (up to and potentially including the 5' end of MCC), but 
no loss of markers distal to MCC Other tumors exhibited 
loss of markers distal to and perhaps including the 3' end of 
45 MCC, but no loss of sequences proximal to MCC. This 
suggested either that different ends of MCC were affected by 
loss in all such cases, or alternatively, that two genes (one 
proximal to and perhaps including MCC, the other distal to 
MCC) were separate targets of deletion. Third, clones from 
50 each of the six FAP region genes were used as probes on 
Southern blots containing tumor DNA from patients with 
Sporadic CRC Only two examples of somatic changes were 
observed in over 200 tumors studied: a rearrangement/ 
deletion whose centromeric end was located within the 
55 MCC gene (Kinzler et aL, supra) and an 800 bp insertion 
within the APC gene between nucleotides 4424 and 5584. 
Fourth, point mutations of MCC were observed in two 
tumors (Kinzler et aL) supra strongly suggesting mat MCC 
was a target of mutation in at least some sporadic colorectal 
60 cancers. 

Based on these results, we attempted to search for subtle 
alterations of contig 3 genes in patients with FAP. We chose 
to examine MCC and APC, rather than TB2 or SRP, because 
of the somatic mutations in MCC and APC noted above. To 
65 facilitate the identification of subtle alterations, the genomic 
sequences of MCC and APC exons were determined (see 
Table I, SEQ ID NO:24-38). 
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TABLE I 
APCEXONS 



EXON 

NUCLEOTIDES 1 EXON BOUNDARY SEQUENCE 2 



822 to 930 catgatimafct^tttacct atagtet^ - - - (SEQ ID NO: 24) 

. . . ACCAAG/gtaacagaagattreaaMmgg^ (SEQ ID NO: 25) 

931 to 1309 ggatattaaa^^aatffl^taaactcattta^acag/GTGGAA . . . (SEQ ID NO: 26) 

. . . AlXXl^g^gttctctatagtgtacatcgtagtgcatg (SEQ ID NO: 27) 
1310 to 1405 cateattgctctteaaataacaa agcattat^^ . . . (SEQ ID NO: 28) 

. . . AACTAG/gtaagacaaaaatgt^ (SEQ ID NO: 29) 

1406 to 1545 taiatgattgtetttttcctcttgccct^ . . . (SEQ ID NO: 30) 

. . . AACAAG/gtatgtttttataacatgtatt ft^ (SEQ ID NO: 31) 

1546 to 1623 gcttggcttcaagttgtotttttaatgatcct^^ . . . (SEQ ID NO: 32) 

. . . CAG^Gfgtactatttapflmraretpttt^^ (SEQ ID. NO: 33) 

1624 to 1740 gcaactamtgattttatgtataaattaafrtara . . . (SEQ ID NO: 34) 

. . . AAAAACygtaccmgaaaacatttagtactataatatgaatttcatgt (SEQ ^JJ 0 ^ 5 ) 

. . . CCACAQ^gtalatatagagttttatattectttt aaag^ (SEQ ID NO: 37) 

1956 to 8973 tcttgatttttatttcagA3CAAAT . . . (SEQ ID NO: 38 ) 

... GGTAITT/QXjCAAAAAAAAAltjl ri l lGTT (SEQ ID NO: 1) 

1 Relative to predicted translation initiation site 

2 Small case letters represent introos, large case letters represent exons 

The entire 3* end of the cloned APC cDNA (nt 1956-8973) appeared to be encoded in this exon, as indicated by 
restriction endonuclease mar/ping and sequencing of die cloned genomic DNA, The ORF ended at nt 8535. The 
extreme 3' end of the APC transcript has not yet been identified. 



These sequences were used to design primers for PGR 
analysis of constitutional DNA from FAP patients. 

We first amplified eight exons and surrounding introns of 
the MCC gene in affected individuals from 90 different FAP ^ 
kindreds. The PCR products were analyzed by a ribonu- 
clease (RNase) protein assay. In brief, the PCR products 
were hybridized to in vitro transcribed RNA probes repre- 
senting the normal genomic sequences. The hybrids were 
digested with RNase A, which can cleave at single base pair 
mismatches within DNA-RNA hybrids, and the cleavage 35 
products were visualized following denaturing gel electro- 
phoresis. Two separate RNase protection analyses were 
performed for each exon, one with the sense and one with 
the antisense strand. Under these conditions, approximately 
40% of all mismatches are detectable. Although some amino 40 
acid variants of MCC were observed in FAP patients, all 
such variants were found in a small percentage of normal 
individuals. These variants were thus unlikely to be respon- 
sible for the inheritance of FAP. 

CWe next examined three exons of the A PC gene. The 45 
three exons examined included those containing nt 822-930, 
931-1309, and the first 300 nt of the most distal exon (nt 
1956-2256). PCR and RNase protection analysis were per- 
formed as described in Kinzler et aL supra, using the primers 
underlined in Table I (SEQ ID NO:24-38). The primers for 50 
nt 1956-2256 were 5-GCAAATCCTAAGAGAGAACAA- 
3* (SEQ ID NO:99) and S'-GATGGCAAGCITGAGCCAG- 
3'iSEQ ID NO:100)7l 

f Un 90 kindreds, thdl<Nase protection method was used to 
screen for mutations and in an additional 13 kindreds, the 55 
PCR products were cloned and sequenced to search fee 
mutations not detectable by RNase protection. PCR products 
were cloned into a Bluescript vector modified as described 
in T. A. Holton and M. W. Graham, Nucleic Acids Res. 19, 
1156 (1991). A minimum 0 f 100 clones were pooled and €0 
sequenced. Five variants were detected among the 103 
kindreds analyzed. Cloning and subsequent DNA sequenc- 
ing of the PCR product of patient P21 indicated a C to T 
transition in codon 413 that resulted in a change from 
arginine to cysteine. This amino acid variant was not 65 
observed in any of 200 DNA samples from individuals 
without FAP. Cloning and sequencing of me PCR product 



18 



from patients P24 and P34, who demonstrated the same 
abnormal RNase protection pattern indicated that both had a 
C to T transition at codon 801 that resulted in a change from 
30 argmme(CGA)toastopcodon^ 

present in 200 individuals without FAR As this point muta- 
tion resulted in the predicted loss of the recognition site for 
the enzyme Taq I, appropriate PCR products could be 
digested with Taq I to detect the mutation. This allowed us 
35 to determine that the stop codon co-segragated with disease 
phenotype in members of the family of P24. The inheritance 
of this change in affected members of the pedigree provide s 
additional evidence for the importance of the mutation^] 
Cloning and sequencing of the PCR product from FAP 
40 patient P93 indicated a C to G transversion at codon 279, 
also resulting in a stop codon (change from TCA to TGA). 
This mutation was not present in 200 individuals without 
FAP. Finally, one additional mutation resulting in a serine 
45 (TCA) to stop codon (TGA) at codon 712 was detected in a 
single patient with FAP (patient P60). 

The five gerrnline mutations identified are summarized in 
Table HA, as well as four others discussed in Example 9, 



50 



TABLE LA 



rw™W mutations of the APT. «ra in FAP and GS Patients 



EXTRA- 
COLO- 

S5 NIC NUCLEO- 
PATTENT TIDE 
DISEASE CODON CHANGE 



AMINO 
ACID 

CHANGE AGE 



60 



93 

Osteoma 

24 
34 



Tumor 
65 21 



279 TCA->TGA 



301 CGA->TGA 
301 CGA->TOA 



413 CGC->TGC 



Ser->Stop 



Arg->Stop 
Arg->Stop 



Arg->Cys 



39 Mandi- 
bular 



46 None 
27 Des- 
moid 



24 Mandi- 
bular 
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TABLE UA-continued 



Germlme mutations of the APC gene in FAP and GS Patients 

5 

EXTRA- 
COLO- 

NIC NUCXEO- AMINO 

PATIENT TIDE ACID 

DISEASE CODO N CHANGE CHANGE AGE 

10 



60 712 TCA->TGA Ser->Stop 37 Mandi- 

bular 



3746 243 CAGAG->CAG splice- 15 

junction 

3460 301 CGA-> TG A Arg->Stop 
3827 456 CTTTCA->CTTCA frameshift 
3712 500 T->G Tyr->Stop 

* The mutated nucleotides are underlined. 20 

fin addition to these gennline mutations, we identified sev- 
jUOL^ o-al somatic mutations of MCC and APC in sporadic CRCs. 

Seventeen MCC exons were examined in 90 sporadic col- 
orectal cancers by RNase protection analysis. In each case 
where an abnormal RNase protection pattern was observed, 25 
the corresponding PCR products were cloned and 
sequenced. This led to the identification of six point muta- 
tions (two described previously) (Kinder et al, supra), each 
of which was not found in the gennline of these patients 
(Table rJB>J 30 

TABLE KB 



Somatic Mutations in Sporadic CRC Patients 









NUCLEOTIDE 




PATIENT 


CODON 1 


CHANGE 




T35 


MCC 12 


GAG/gtaaga-> 








GAG/gtaaaa 




T16 


MCC 145 


ctcag/OGA-> 








atcag/GGA 




T47 


MCC 267 


CX3G->CTG 




TBI 


MCC 490 


TCG->TTG 




T35 


MCC 506 


CGG->CAG 




191 


MCC 698 


GCToGTT 




T34 


APC 288 


ccaotScccagccagt 




T27 


APC 331 


CGA->TGA 




T135 


APC 437 


CAA/gtaa->CAA/gcaa 




T2CI 


APC 1338 


CAG->TAG 



AMINO 35 

ACID 

CHANGE 



(Spike 

Donor) 

(Splice 

Acceptor) 

Arg->Leu 

Ser->Leu 

Arg->Gm 

Ala->Val 

(Insertion) 

Arg->Stop 

(Splice 

Donor) 

Gm->Stop 



40 



45 



50 



Pot gpiioe site mutations, the codon nearest to the mutation is listed 
The underlined nucleotides were mutant; small case letters represent introns, 
large case letters represent exons 

Four of the mutations resulted in amino acid substitutions 
and two resulted in the alteration of splice site consensus 
elements. Mutations at analogous splice site positions in 
other genes have been shown to alter RNA processing in 55 
vivo and in vitro. 

Three exons of APC were also evaluated in sporadic 
tumors. Sixty tumors were screened by RNase protection, 
and an additional 98 tumors were evaluated by sequencing. 
The exons examined included nt 822-930, 931-1309, and 60 
1406-1545 (Table I). A total of three mutations were 
identified, each of which proved to be somatic. Tumor T27 
contained a somatic mutation of CGA (arginine) to TGA 
(stop codon) at codon 33. TumorT135 contained a GT to GC 
change at a splice donor site. Tumor T34 contained a 5 bp 65 
insertion (CAGCC between codons 288 and 289) resulting 
in a stop at codon 291 due to a frameshift 
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We serendipitously discovered one additional somatic 
mutation in a colorectal cancer. During our attempt to define 
the sequences and splice patterns of the MCC and APC gene 
products in colorectal epithelial cells, we cloned cDNAfrom 

5 the colorectal cancer cell line SW480. The amino acid 
sequence of the MCC gene from SW480 was identical to 
that previously found in clones from human brain. The 
sequence of APC in SW480 cells, however, differed 
significantly, in that a transition at codon 1338 resulted in a 

10 change from glutamine (CAG) to a stop codon (TAG). To 
determine if this mutation was somatic, we recovered DNA 
from archival paraffin blocks of the original surgical speci- 
men (T201) from which the tumor cell line was derived 28 
years ago. 

15 DNA was purified from paraffin sections as described in 
S. E. Goelz, S. R. Hamilton, and B. Vogelstein. Biochem. 
Biophys. Res. Comm. 130, 118 (1985). PCR was performed 
as described in reference 24, using the primers 
S'-GTTCCAGCAGTGTCACAG-S' (SEQ ID NO: 101) and 

20 5 , <KK}AG/aT^CGC^<XTGA-3 , (SEQ ID NO:102). A 
PCR product containing codon 1338 was amplified from the 
archival DNA and used to show that the stop codon repre- 
sented a somatic mutation present in the original primary 
tumor and in cell lines derived from the primary and 

25 metastatic tumor sites, but not from normal tissue of the 
patient 

The ten point mutations in the MCC and APC genes so far 
discovered in sporadic CRCs are summarized in Table IIB. 
Analysis of the number of mutant and wild-type PCR clones 

30 obtained from each of these tumors showed that in eight of 
the ten cases, the wild-type sequence was present in approxi- 
mately equal proportions to the mutant This was confirmed 
by RFLP analysis using flanking markers from chromosome 
5q which demonstrated that only two of the ten tumors 

35 (T135 and T201) exhibited an allelic deletion on chromo- 
some 5q. These results are consistent with previous obser- 
vations showing that 20-40% of sporadic colorectal tumors 
had allelic deletions of chromosome 5q. Moreover, these 
data suggest that mutations of 5q21 genes are not limited to 

40 those colorectal tumors which contain allelic deletions of 
this chromosome. 

EXAMPLE 4 

This example characterizes small, nested deletions in 

45 DNA from two unrelated FAP patients. 

DNA from 40 FAP patients was screened with cosmids 
that has been mapped into a region near the APC locus to 
identify small deletions or rearrangements. Two of these 

50 cosmids, L5.71 =nd L5.79, hybridized with a 1200 kb NotI 
fragment in DNAs from most of the FAP patients screened. 

The DNA of one FAPpatient, 3214, showed only a 940 kb 
NotI fragment instead of the expected 1200 kb fragment 
DNA was analyzed from four other members of the patient's 

55 immediate family; the 940 kb fragment was present in her 
affected mother (4711), but not in the other, unaffected 
family members. The mother also carried a normal 1200 kb 
Not! fragment that was transmitted to her two unaffected 
offspring. These observations indicated that the mutant 

60 polyposis allele is on the same chromosome as the 940 kb 
NotI fragment A simple interpretation is mat APC patients 
3214 and 4711 each carry a 260 kb deletion within the APC 
locus. 

If a deletion were present, then other enzymes might also 
65 be expected to produce fragments with altered mobilities. 
Hybridization of L5.79 to Nrul-digested DNAs from both 
affected members of the family revealed a novel Nrul 
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fragment of 1300 kb, in addition to the normal 1200 kb Nrul 
fragment Furthermore, Mlul fragments in patients 3214 and 
4711 also showed an increase in size consistent with the 
deletion of an Mlul site. The two chromosome 5 homologs 
of patient 3214 were segregated in somatic cell hybrid lines; 5 
HHW1155 (deletion hybrid) carried the abnormal homolog 
and HHW1159 (normal hybrid) carried the normal homolog. 
/ fBecause patient 8214 showed bray a 940 Id) NotI 
Jhrt ^fragment, she had not inherited the 1200 kb fragment present 
9 in the unaffected father's DNA. This observation suggests 10 

that he must be heterozygous for, and have transmitted, 
either a deletion of the L5.79 probe region or a variant NotI 
fragment too large to resolve on the gel system. As expected, 
the hybrid cell line HHW1159, which carries the paternal 
homolog, revealed no resolved Not fragment when probed 
with L5.79. However, probing of HHW1159 DNA with 
L5.79 following digestion with other enzymes did reveal 
restriction fragments, demonstrating the presence of DNA 
homologous to the probe. The father is, therefore, inter- 
preted as heterozygous for a polymorphism at the NotI site, 
with one chromosome 5 having a 1200 kb NotI fragment and 20 
the other having a fragment too large to resolve consistently 
on the geL The latter was transmitted to patient 3214J 
ft [When double digests were used to order restriction sites 
(L^ A 1 within the 1200 kb NotI fragment, L5.71 and L5.79 were 

beth found to He on a 550 kb Notl-Nrul fragment and, 25 
therefore, on the same side of an Nrul site in the 1200 kb 
NotI fragment To obtain genomic representation of 
sequences present over the entire 1200 kb Nod fragment, we 
constructed a library of small-fragment inserts enriched for 
sequences from mis fragment DNA from the somatic cell 30 
hybrid HHW141, which contains about 40% of chromosome 
5 , was digested with NotI and electrophoresed under pulsed- 
field gel (PFG) conditions; EcoRI fragments from the 1200 
ldb region of mis gel were cloned into a phage vector. Probe 
Map30 was isolated from this library. In normal individuals 35 
probe Map30 hybridizes to the 1200 kb NotI fragment and 
to a 200 Id) Nrul fragment This latter hybridization places 
Map30 distal, with respect to the locations of L5.71 and 
L5.79, to the Nrul site of the 550 kb Notl-Nrul fragment) 

Because Map30 hybridized to the abnormal, 1300 kb Nrul 40 
fragment of patient 32 14, the locus defined by Map30 lies 
outside the hypothesized deletion. Furthermore, in normal 
chromosomes Map30 identified a 200 kb Nrul fragment and 
L5.79 identified a 1200 Kb Nrul fragment; the hypothesized 
deletion must, therefore, be removing an Nrul site, or sites, 45 
lying between Map30 and L5.79, and these two probes must 
flank the hypothesized deletion. A restriction map of the 
genomic region, showing placement of these probes, is 
shown in FIG. 5. 

A NotI digest of DNA from another FAP patient, 3824, 50 
was probed with L5.79. In addition to the 1200 kb normal 
NotI fragment, a fragment of approximately 1100 kb was 
observed, consistent with the presence of a 100 kb deletion 
in one chromosome 5. In this case, however, digestion with 
Nrul and Mlul did not reveal abnormal bands, indicating that 55 
if a deletion were present, its boundaries must lie distal to the 
Nrul and Mlul sites of the fragments identified by L5.79. 
Consistent with this expectation, hybridization of Map30 to 
DNA from patient 3824 identified a 760 kb Mlul fragment 
inaddition to the expected 860 Kb fragment, supporting the 60 
interpretation of a 100 Id) deletion in this patient The two 
chromosome 5 homologs of patient 3824 were segregated in 
somatic cell hybrid lines; HHW1291 was found to carry 
only the abnormal homolog and HHW1290 only the normal 
homolog. . 65 

That the 860 kb Mlul fragment identified by Map30 is 
distinct from the 830 kb Mlul fragment identified previously 
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by L5.79 was demonstrated by hybridization of Map30 and 
L5.79 to a Notl-Mlul double digest of DNA from the hybrid 
cell (HHW1159) containing the nondeleted chromosome 5 
homolog of patient 3214. As previously indicated, this 

5 hybrid is interpreted as missing one of the NotI sites that 
define the 1200 kb fragment A 620 kb Notl-Mlul fragment 
was seen with probe L5.79, and an 860 kb fragment was 
seen with Map30. Therefore, the 830 kb MM fragment 
recognized by probe L5.79 must contain a NotI site in 

10 HHW1159 DNA; because the 860 kb Mlul fragment 
remains intact, it does not carry this NotI site and must be 
distinct from the 830 kb Mlul fragment 

EXAMPLE 5 

15 This example demonstrates the isolation of human 
sequences which span the region deleted in the two unre- 
lated FAP patients characterized in Example 4. 

A strong prediction of the hypothesis that patients 8214 
and 3824 carry deletions is that some sequences present on 

20 normal chromosome 5 homologs would be missing from the 
hypothesized deletion homologs. Therefore, to develop 
genomic probes mat might confirm the deletions, as well as 
to identify genes from the region, YAC clones from a contig 
seeded by cosmid L5.79 were localized from a library 

25 containing seven haploid human genome equivalents 
(Albertsen et aL, Proc. Natl Acad. Sci. U.S.A., VoL 87, pp. 
4256-4260 (1990)) with respect to the hypothesized dele- 
tions. Three clones, YACs 57B8, 310D8, and 183H12, were 
found to overlap the deleted region. 

Importantly, one end of YAC 57B8 (clone XT57) was 
found to lie within the patient 3214 deletion. Inverse poly- 
merase chain reaction (PCR) defined the end sequences of 
the insert of YAC 57B8. PCR primers based on one of these 

35 end sequences repeatedly failed to amplify DNA from the 
somatic cell hybrid (HHW1155) carrying the deleted 
homolog of patient 3214, but did amplify a product of the 
expected size from the somatic cell hybrid (HHW1159) 
carrying the normal chromosome 5 homolog. This result 

^ supported the interpretation that the abnormal restriction 
fragments found in the DNA of patient 3214 result from a 
deletion. 

Additional support for the hypothesis of deletion in DNA 
from patient 3214 came from subcloned fragments of YAC 

4 5 1 83H 12, which spans the region in question. Yl 1, an EcoRI 
fragment cloned from YAC 183H12, hybridized to the 
normal, 1200 kb Nod fragment of patient 4711, but failed to 
hybridize to the abnormal, 940 kb NotI fragment of 4711 or 
to DNA from deletion cell line HHW1155. This result 

50 confirmed the deletion in patient 3214. 

Two additional EcoRI fragments from YAC 183H12, Y10 
and Y14, were localized within the patient 3214 deletion by 
their failure to hybridize to DNA from HHW1155. Probe 
Y10 hybridizes to a 150 kb Nrul fragment in normal 

55 chromosome 5 homologs. Because the 3214 deletion creates 
the 1300 kb Nrul fragment seen with the probes L5.79 and 
Map30 that flank the deletion, these Nrul sites and the 150 
kb Nrul fragment lying between must be deleted in patient 
32 14. Furthermore, probe Y10 hybridizes to the same 620 kb 

€0 Notl-Mlul fragment seen with probe L5.79 in normal DNA, 
indicating its location as L5.79-proximal to the deleted Mlul 
site and placing it between the Mlul site and the L5.79- 
proximal Nrul site. The Mlul site must, therefore, lie 
between the Nrul sites that define the 150 kb Nrul fragment 

65 (see FIG. 5). 

Probe Yll also hybridized to the 150 kb Nrul fragment in 
the normal chromosome 5 homolog, but failed to hybridize 
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to the 620 kb Notl-Mlul fragment, placing it L5.79-distal to 
the Mlul site, but proximal to the second Nrul site. Hybrid- 
ization to the same (860 Kb) Mlul fragment as Map30 
confirmed the localization of probe YU L5.79-distal to the 
Mlul site. 5 

Probe Y14 was shown to be L5.79-distal to both deleted 
Nrul sites by virtue of its hybridization to the same 200 kb 
Nrul fragment of the normal chromosome 5 seen with 
Map30. Therefore, the order of these EcoRI fragments 
derived from YAC 183H12 and deleted in patient 3214, with w 
respect to L5.79 and Map30, is L5.79-Y10-Y11-Y14- 
Map30. 

The 100 kb deletion of patient 3824 was confirmed by me 
failure of aberrant restriction fragments in this DNA to ^ 
hybridize with probe Yll, combined with positive hybrid- 
izations to probes Y10 and/or Y14. Y10 and Y14 each 
hybridized to the 1100 Kb NotI fragment of patient 3824 as 
well as to the normal 1200 Kb Nod fragment, but Yll 
hybridized to the 1200 kb fragment only. In the Mluldigest, 
probe Y 14 hybridized to the 860 Kb and 760 kb fragments 
of patient 3824 DNA, but probe Yll hybridized only to the 
860 kl3 fragment We conclude that the basis for the 
alteration in fragment size in DNA from patient 3824 is, 
indeed, a deletion. Furthermore, because probes Y10 and ^ 
Y14 are missing from the deleted 3214 chromosome, but 
present on the deleted 3824 chromosome, and they have 
been shown to flank probe Yll, the deletion in patient 3824 
must be nested within the patient 3214 deletion. 

Probes Y10, Yll, Y14 and Map30 each hybridized to ^ 
YAC 310D8, indicating that mis YAC spanned the patient 
3824 deletion and at a minimum, most of the 3214 deletion. 
Hie YAC characterizations, therefore, confirmed the pres- 
ence of deletions in the patients and provided physical 
representation of the deleted region. 35 

EXAMPLE 6 

This example demonstrates mat the MCC coding 
sequence maps outside of the region deleted in the two FAP 
patients characterized in Example 4. 40 

An intriguing FAP candidate gene, MCC, recently was 
ascertained with cosmid L5.71 and was shown to have 
undergone mutation in colon carcinomas (Kinzler et at, 
supra). It was therefore of interest to map this gene with 
respect to the deletions in APC patients. Hybridization of 45 
MCC probes with an overlapping series of YAC clones 
extending in either direction from L5.71 showed that the 3 
end of MCC must be oriented toward the region of me two 
APC deletions. 

Therefore, two 3* cDNA clones from MCC were mapped so 
with respect, to the deletions: clone la (bp 2378-4181) and 
done 7 (bp 2890-3560). Clone Id contains sequences from 
the C-terminal end of the open reading frame, which stops 
at nucleotide 2708, as well as 3' untranslated sequence, 
done 7 contains sequence that is entirely 3* to the open 55 
reading frame. Importantly, the entire 3* untranslated 
sequence contained In the cDNA clones consists of a single 
2.5 kb exon. These two clones were hybridized to DNAs 
from the YACs spanning the FAP region. Clone 7 fails to 
hybridize to YAC 310D8, although it does hybridize to «o 
YACs 183H12 and 57B8; the same result was obtained with 
the cDNA 1CL Furthermore, these probes did show hybrid- 
ization to DNAs from both hybrid cell lines (HWW1159 and 
HWW1155) and the lymphoblastoid cell line from patient 
3214, cortorimgmefr location 65 
Additional mapping experiments suggested that the 3' end of 
the MCC cDNA clone contig is likely to be located more 
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than 45 kb from the deletion of patient 3214 and, therefore, 
more than 100 kb from the deletion of patient 3824. 



EXAMPLE 7 



This example identifies three genes within the deleted 
region of chromosome 5 in the two unrelated FAP patients 
characterized in Example 4. 
Genomic clones were used to screen cDNA libraries in 

10 three separate experiments. One screening was done with a 
phage clone derived from YAC 310D8 known to span the 
260 kb deletion of patient 3214. A large-insert phage library 
was constructed from this YAC; screening with Yll iden- 
tified X205, which mapped within both deletions. When 

15 clone X205 was used to probe a random-, plus oligo(dT)-, 
primed fetal brain cDNA library (approximately 300,000 
phage), six cDNA clones were isolated and each of them 
mapped entirely within both deletions. Sequence analysis of 
these six clones formed a single cDNA contig, but did not 

2Q reveal an extended open reading frame. One of the six 
cDNAs was used to isolate more cDNA clones, some of 
which crossed the L5.71-proximal breakpoint of the 3824 
deletion, as indicated by hybridization to both chromosome 
of mis patient These clones also contained an open reading 

23 frame, indicating a transcriptional orientation proximal to 
distal with respect to L5.71. This gene was named DPI 
(deleted in polyposis 1). This gene is identical to TB2 
described above. 
cDNA walks yielded a cDNA contig of 3.0-3.5 kb, and 

30 included two clones containing terminal poly(A) sequences. 
This size corresponds to the 3.5 kb band seen by Northern 
analysis. Sequencing of the first3163 bp of the cDNA contig 
revealed an open reading frame extending from the first base 
to nucleotide 631, followed by a 2.5 kb 3* untranslated 

35 region. The sequence surrounding the methionine codon at 
base 77 conforms to the Kozak consensus of an initiation 
methionine (Kozak, 1984). Failed attempts to walk farmer, 
coupled with the similarity of the lengths of isolated cDNA 
and mRNA, suggested that the rTO 2 -terminus of the DPI 

40 protein had been reached. Hybridization to a combination of 
genomic and YAC DNAs cut with various enzymes indi- 
cated the genomic coverage of DPI to be approximately 30 
kb. 

& Ejyo additional probes for the locus, YS-11 and YS-39, 

45 which had been ascertained by screening of a cDNA library 
with an independent YAC probe identified with MCC 
sequences adjacent to L5.71, were mapped into the deletion 
region. YS-39 was shown to be a cDNA identical in 
sequence to DPI. Partial characterization of YS-11 had 

50 shown that 200 bp of DNA sequence at one end was 
identical to sequence coding for the 19 kd protein of the 
ribosomal signal recognition particle, SRP19 (Lingelbach et 
aL, supra). Hybridization experiments mapped YS-11 within 
beth deletions. The sequence of this clone, however, was 

55 found to be complex. Although 454 bp of the 1032 bp 
sequence of YS-11 were identical to the GenBank entry for 
the SRP19 gene, another 578 bp appended 5' to the SRP19 
sequence was found to consist of previously unreported 
sequence containing no extended open reading frames. This 

60 suggested that YS-1 1 was either a chimeric clone containing 
two independent inserts or a clone of an incompletely 
processed or aberrant message. If YS-11 were a conven- 
tional chimeric clone, the independent segments would not 
be expected to map to the same physical region. The 

65 segments resulting from anomalous processing of a continu- 
ous transcript, however, would map to a single chromosomal 
region J 
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Inverse PCR with primers specific to the two ends of 
YS-11, the SRP19 ,end and the unidentified region, verified 
that both sequences map within the YAC 310D8; therefore, 
YS-11 is most likely a clone of an immature or anomalous 
mRNA species. Subsequently, both ends were shown to lie 5 
with the deleted region of patient 3824, and YS-11 was used 
to screen for additional cDNA clones. 

Of the 14 cDNA clones selected from the fetal brain 
library, one clone, V5, was of particular interest in that it 
contained an open reading frame throughout, although it 10 
included only a short identity to the first 78 5* bases of the 
YS-11 sequence. Following the 78 bp of identical sequence, 
the two cDNA sequences diverged at an AG. Furthermore, 
divergence from genomic sequence was also seen after these 
78 bp, suggesting the presence of a splice junction, and 15 
supporting the view that YS-11 represents an irregular 
message. 

Starting with V5, successive 5' and 3* walks were per- 
formed; the resulting cDNA contig consisted of more than 
100 clones, which defined a new transcript, DP2. Clones 
walking in the 5' direction crossed the 3824 deletion break- 
point farthest from L5.71; since its 3* end is closer to this 
cosmid than its 5' end, the transcriptional orientation of DP2 
is opposite to that of MCC and DPI. 

The third screening approach relied on hybridization with 
a 120 kb MM fragment from YAC 57B8. This fragment 
hybridizes with probe Yll and completely spans the 100 kb 
deletion in patient 3824. the fragment was purified on two 
preparative FFGs, labeled, and used to screen a fetal brain ^ 
cDNA library. A number of cDNA clones previously iden- 
tified in the development of the DPI and DP2 contigs were 
reascertained. However, 19 new cDNA clones mapped into 
the patient 3824 deletion. Analysis indicated that these 19 
formed a new contig, DP3, containing a large open reading 35 
frame. 

A clone from the 5* end of this new cDNA contig 
hybridized to the same EcoRI fragment as the 3* end of DPI 
Subsequently, the DP2 and DP3 contigs were connected by 
a single 5* walking step from DP3, to form me single contig 40 
DP2.5. The complete nucleotide sequence of DP2.5 is 
shown in FIG. 9. 
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The consensus cDNA sequence of DP2.5 suggests that the 
entire coding sequence of DP2.5 has been obtained and is 
8532 bp long. The most 5* ATG codon occurs two codons 
from an in-frame stop and conforms to the Kozak initiation 

5 consensus (Kozak, NucL Acids. Res., VoL 12, p. 857-872 
1984). The 3' open reading frame breaks down over the final 
1.8 kb, giving multiple stops in all frames. A poly(A) 
sequence was found in one clone approximately 1 kb into the 
3* untranslated region, associated with a polyadenylation 

10 signal 33 bp upstream (position 9530). The open reading 
frame is almost identical to that identified as APC above. 

An alternatively spliced exon at nucleotide 934 of the 
DP2.5 transcript is of potential interest it was first discov- 
ered by noting that two classes of cDNA had been isolated. 

15 The more abundant cDNA class contains a 303 bp exon not 
included in the other. The presence in vivo of the two 
transcripts was verified by an exon connection experiment 
Primers flanking the alternatively spliced exon were used to 
amplify, by PCR, cDNA prepared from various adult tissues. 

20 Two PCR products that differed in size by approximately 
300 bases were amplified from all the tissues tested; the 
larger product was always more abundant than the smaller. 
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This example demonstrates the primers used to identify 
subtle mutations in DPI, SRP19, and DP25. 
c&f /5b obtain DNA sequence adjacent to the exons of the 
^ffn genes D pi, DP2.5, and SRP19, sequencing substrate was 
<T* ~ i obtained by inverse PCR amplification of DNAs from two 
O * YACs, 3 10D8 and 183H12, that span the deletions. ligation 
at low concentration cyclized the restriction enzyme- 
digested YAC DNAs. Oligonucleotides with sequencing 
tails, designed in inverse orientation at intervals along the 
35 cDNAs, primed PCR amplification from the cyclized tem- 
plates. Comparison of these DNA sequences with the cDNA 
sequences placed exon boundaries at the divergence points. 
SRP19 and DPI were each shown to have five exons. DP2.5 
consisted of 15 exons. The sequences of me oligonucleotides 
40 synthesized to provide PCR amplification primers for the 
exons of each of these genes are listed in Table m SEQ ID 
NO:39-947J 



7. 



TABLE UI 



Sequences of Primers Used fo r SSCP Analyses 
Exon Primer 1 Primer 2 



DPI 

UP-IXXXXGCCTOCOGCr^ RP-GCAGCGGCGGCT0O3GTC 
VP-GTQAACGGCICTCXrGCTGC RP-ACCftQOGQGGAQQAATGGA 
UP-ATOADS3XnTACCAAATGATA3AC RP-TEATTCCTA Cllci iCE AJACAQ 
rjpjTACXXAlGCIGGCTCTITiTC RP-TCOGGCCATUiioiiOCKlA 
ur RP-ATCAAGCTCCACnAAGAAGGTA 

SRP19 



UP-ACATIAGGCACAAAGCTiaCAA 



UP-TGOGGCTOCTXXXnTGTTG RP-GCCXX7ITOCnTCTaA^aA^ 

UP-TmCICCKXX^^ ^-ATGACACXCOCC^TO^ 

UP^CCACTIAAAGCACAlVaATriAOT RP^TrxrGGAAAATAGTOAAGAACC 

UP-TTCnAAGIC ClUlll 1 H G RP-TmOAAOJlTlTllUl^lTOrG 

UP^riCAGATTAI^CACnAAGCCIAAC RP-CAIXJIUICITACAGTAGTAOCA 

DP15 

UP-AGGTCCAAGGGTAGCCAAGG* RP-T\AAAATOGAT^AACC^AATrrAAAAG 

UP-AAAlACAGAAICAiartrriGAAGrr RP-ACACXTIAAAGATGACAATTTO 

UP-TAACTIAG ATAGCAGTAATTrcCC * RP-AGAA3AAACTGGAOTACACAAGG 

lJP-AIAGGTCATIt3CTTXTITGCIX3AT* RP-TGAATTTTAATOGATIACCrAGCSrr 

UP^jTTTTnTGCTTTIACTGATTAACG RP-TtJTAATI CATTT IATIYXXAATACCTC 

UP43GTA^GCCATAGTA3X3ATrAriTCT RP-CTACCTATTITIAmCCCACAAAC 



• # 
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TABLE M-continued 



Sequences of Primers Used for SSCP Analyses 



Exan Primer 1 



Primer 2 



3-A 
-B 
-C 
-D 
^ 
-F 
J3 
-H 
-I 
-J 
-K 
-L 
-N 
-M 
-O 
-P 

■Q 

-R 

-S 
-T 
-U 
-V 

-w 



UP-AAGAAAGCCTACACCAITnTGC 
UP-ACCTAmGTCTAAAriAIACCATC 
UP-AGTCGTAAITITGTTTCTAAACrc 
UP-TCAITCACTCACAGCCTGATGAC* 
UP-AAACAICAriXXTCITC AAAIA AC 
UP^AIGATIXnCITITIXXTCTITGC 
UP-TITIAAAIGAICCTCTmcrGriAr 
UP-TITCTAlTCrriACrOCTAGCArr 
UP-TAG/JGACCCAEAITCTCriTC 



UP-GTTACTGCATACACArTGTGAC 



UP-AGTACAAGGATGCCAAEATTATG* 
UP- ATITOAAIACIACAGTGrnACCC * 
UP^IGCCCAIACACArrcAAACAC* 
UP-AGIXHTAAAXATTCAGAIGAGCAG* 
UP-AAGCCTACCAAriA3AGTGAACG* 
UP-AAGAAACAAIACAGACTTAITGTG* 
UPAICrCCCTCa^AAAGTGGrGC* 



UP-AGTAAA3XXTTGCAGTICAGAGG* 
XJP-CCCAGACTGCTTC AAAATTACC * 
UP-CCCTOCAAATGAGTEAGCTGC* 
UP-ACCCAACAAAAATCAGTTAGAIG* 
UP-AIGArGTTQACXTXTOCAGGG* 
UP-AAAGACAEACCAGACAGAGGG* 
UP-AAGAIGACCIGTTGCAGGAATG* 
UP-CAATAGTAAGIAGTnACAICAAG* 
UP-CAGCCCCnCAAGCAAACATC* 
UPO^GTCrcCTGGCCGAAACrc* 



UP-TGGTAAIGGAGOCAAIAAAAAGG* 
UP-TOTCTCEAIOCACACA rrOCJI C' 
UP-GG AG AAG AACIXKJ AAGTTCATC * 
W-TCIXXCACAGGTAAIACTCCC 



UP-CAGGACAAAAIAAICCrGrrCCC 



RP-GATCArrcnAGAACCATCITGC 

RP^nCATCGCATIACTGACCAG 

RP-TGAAGGACTCCGATTTC ACCC * 

RP<3CTTrGAAACATGCACIACGAr 

RP-TACCATGAITTAAAAArCCACCAG 

RP^riGAGCTATCITAAGAAArACATG 

RP-ACAGAGTCAGAOCCTCCCTCAAAG 

RP-AIACACAGGTAAGAAATTAGGA 

RP^AAITAGGTCTTTTIGAGAGTA 

RP-GCTTITTGTITCXjTAACArGAAG* 

IO»-ACTrCTXnrrnTICAGAACGAG* 

RP<rnxnATlCIAAITrGG CAIAA .GG* 

RP>TG11'1UCGTCTIX3CCC ATCTT* 

RP-AGCTGATGACAAAGArGAEAATC* 

RP-AIG AGTGGGGTCTCCTG AAC ♦ 

RP-TCCATCTGGAGTACriTCrrGTG* 

RP-CCGTGGCAIATCAIXXaX* 

RP43AOCCTCATCI GTACT ICTOC* 

RP-TTGTCGTAlAGGnTTACTGGTG* 

RP47IGGCIGGTAACITIAGCCrC» 

RP-AITXHOTAACTITICArCAGriGC* 

RP-CT1TITR3GCAITG0GGAGCT* 

RP43AATCAGACCAAGCITGrrCTAGAr* 

RP-AAACAGGACTTGTACTGTAGGA* 

RP^AGGACTTAITCCATITCTACC * 

RP^nTGACTGGCGTACTAAJACAG* 

RP-TGGGACTTTTCGCCAICCAC ♦ 

RP-AItnTITK^IXXnCACTriTI^ 

RP-TIGAAICTITAAIGITIGGAnTGC* 

RP-GCTACAACTGAATOGGGIACG 

I^-AITITCriTACnTICAITCTrCCTC 



read in the 5* to 3" (Erection, me first primer in each pair lies 5* of the exon it amplifies: 

;j* ^.1 U*nt!£ul V» art *et«i 



second primer lies 3' of the exon it amplifies. Primers mat lie wimm the exon are identified by 
UP represents me -21M13 universal pnmer sequence: 
RP represents the M13 reverse primer sequence?! 



With the exception of exons 1, 3, 4, 9, and 15 of DP2.5 (see 
below), the primer sequences were located in intron 
sequences flanking the exons. The 5* primer of exon 1 is 40 
complementary to the cDNA sequence, but extends just into 
the 5* Kozak consensus sequence for the initiator 
methionine, allowing a survey of the translated sequences. 
The 5* primer of exon 3 is actually in the 5' coding sequences 
of this exon, as three separate intronic primers simply would 45 
not amplify. The 5* primer of exon 4 just overlaps the 5' end 
of this exon, and we thus fail to survey the 19 most 5' bases 
of this exon. For exon 9, two overlapping primer sets were 
used, such that each had one end within the exon. For exon 
15, the large 3' exon of DP2.5, overlapping primer pairs 50 
were placed along the length of the exon; each pair amplified 
a product of 250-400 bases. 



This example demonstrates the use of single stranded 55 
conformation polymorphism (SSCP) analysis as described 
by Orita et aL Proc. NatL Acad. ScL U.S A., Vol. 86, pp. 
2766-70 (1989) and Genomics, Vol 5, pp. 874-879 (1989) 
as applied to DPI, SRP19 and DP2 J. 

SSCP analysis identifies most single- or multiple-base 60 
changes in DNA fragments up to 400 bases in length. 
Sequence alterations are detected as shifts in electrophoretic 
mobility of single-stranded DNA on nondenaturing acryla- 
mide gels; the two complementary strands of a DNA seg- 
ment usually resolve as two SSCP conformers of distinct 65 
mobilities. However, if the sample is from an individual 
heterozygous for a base-pair variant within the amplified 




EXAMPLE 9 



segment, often mice or more bands are seen. In some cases, 
even the sample from a homozygous individual will show 
4Q multiple bands. Base-pair-change variants are identified by 
differences in pattern among the DNAs of the sample set 
Exons of the candidate genes were amplified by PCR 
from the DNAs of 61 unrelated FAP patients and a control 
set of 12 normal individuals. The five exons from DPI 
revealed no unique conformers in the FAP patients, although 
45 common conformers were observed with exons 2 and 3 in 
some individuals of both affected and control sets, indicating 
the presence of DNA sequence polymorphisms. Likewise, 
none of the five exons of SRP19 revealed unique conformers 
in DNA from FAP patients in the test panel. 
50 Testing of exons 1 through 14 and primer sets A through 
N of exon 15, of the DP2.5 gene, however, revealed variant 
conformers specific to FAP patients in exons 7, 8, 10, 11, and 
15. These variants were in the unrelated patients 3746, 3460, 
3827, 3712, and 3751, respectively. The PCR-SSCP proce- 
55 dure was repeated for each of these exons in the five affected 
individuals and in an expanded set of 48 normal controls. 
The variant bands were reproducible in the FAP patients but 
were not observed in any of the control DNA samples. 
Additional variant conformers in exons 11 and 15 of the 
DP2.5 gene were seen; however, each of these was found in 
60 both the affected and control DNA sets. The five sets of 
conformers unique to the FAP patients were sequenced to 
determine the nucleotide changes responsible for their 
altered mobilities. The normal conformers from the host 
individuals were sequenced also. Bands were cut from the 
65 dried acrylamide gels, and the DNA was eluted. PCR 
amplification of these DNAs provided template for sequenc- 
ing. 
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D 29 
a ft£* /The sequences of the unique conformers from exQns 7, 8, 
Jj** % lO^and 11 of DP2.5 revealed dramatic mutations in the 
DP2.5 gene. The sequence of the new mutation creating the 
exon 7 conformer in patient 3746 was shown to contain a 
deletion of two adjacent nucleotides, at positions 730 and 
731 in the cDNA sequence (FIG. 7, SEQ ID NO:l). The 
normal sequence at this splice junction is CAQGGTCA 
(intronic sequence underlined), with the intron-exon bound- 
ary between the two repetitions of AG. The mutant allele in 
mis patient has the sequence CAGGTCA. Although this 
change is at the 5' splice site, comparison with known 
consensus sequences of splice junctions would suggest that 
a functional splice junction is maintained. If mis new splice 
junction were functional, the mutation would introduce a 
frameshift that creates a stop codon 15 nucleotides down- 
stream. If the new splice junction were not functional, 
messenger processing would be significantly altered] 

To confirm the 2-base deletion, the PCR product from 
FAP patient 3746 and a control DNA were electrophoresed 
on an acrylamide-urea denaturing gel, along with the prod- ^ 
ucts of a sequencing reaction. The sample from patient 3746 
showed two bands differing in size by 2 nucleotides, with the 
larger band identical in mobility to the control sample; this 
result was independent confirmation that patient, 3746 is 
heterozygous for a 2 bp deletion. 
/v> fjhe unique conformer found in exon 8 of patient 3460 
Jk^^t 1 was found to carry a C-T transition, at position 904 in the 
cDNA sequence of DP2.5 (shown in FIG. 7), which replaced 
the normal sequence of CGA with TGA. This point 
mutation, when read in frame, results in a stop codon 30 
replacing the normal arginine codon. This single-base 
change had occurred within the context of a CG dimer, a 
potential hot spot for mutation (Barker et at, 1984^} 

The conformer unique to FAP patient 3827 in exon 10 was 
found to contain a deletion of one nucleotide (1367, 1368, or 35 
1369) when compared to the normal sequence found in the 
other bands on the SSCP geL This deletion, occ urring w ithin 
a set of three T's, changed the sequence from Cl l iCA to 
CTTCA; mis 1 base frameshift creates a downstream stop 
within 30 bases. The PGR product amplified from mis 40 
patient's DNA also was electrophoresed on an acrylamide- 
urea denaturing gel, along with the PCR product from a 
control DNA and products from a sequencing reaction. The 
patient's PCR product showed two bands differing by 1 bp 
in length, with the larger identical in mobility to the PCR 45 
product from the normal DNA; this result confirmed the 
presence of a 1 bp deletion in patient 3827. 

Sequence analysis of the variant conformer of exon 11 
from patient 3712 revealed the substitution of a T by a G at 
position changing the nonrial tyrosine codon to a stop codon. 50 

The pair of conformers observed in exon 15 of the DP2.5 
gene for FAP patient 3751 also was sequenced. These 
conformers were found to carry a nucleotide substitution of 
C to G at position 5253 , the third base of a valine codon. No 
amino add change resulted from this substitution, suggest- 
ing that mis conformer reflects a genetically silent polymor- 
phism. 

The observation of distinct inactivating mutations in the 
DP23 gene in four unrelated patients strongly suggested 
mat DP2.5 is the gene involved in FAP. These mutations are 60 
summarized in Table UA. 

EXAMPLE 10 

This example demonstrates that the mutations identified 
in the DP2.5 (APQ gene segregate with the FAP phenotype. 65 

Patient 3746, described above as carrying an APC allele 
with a frameshift mutation, is an affected offspring of two 
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normal parents. Colonoscopy revealed no polyps in either 
parent nor among the patient's three siblings. 

FpNA samples from both parents, from the patient's wife, 
and from their three children were examined. SSCP analysis 

5 of DNA from both of the patient's parents displayed the 
normal pattern of conformers for exon 7, as did DNA from 
the patients's wife and one of his off-spring. The two other 
children, however, displayed the same new conformers as 
their affected father. Testing of the patient and his parents 

10 with highly polymorphic VNTR (variable number of tandem 
repeat) markers showed a 99.98% likelihood that they are 
his biological parents^ 

These observations confirmed that this novel conformed 
known to reflect a 2 bp deletion mutation in the DP2.5 gene, 

15 appeared spontaneously with FAP in this pedigree and was 
transmitted to two of the children of the affected individual 



■ This example demonstrates polymorphisms in the APC 
gene which appear to be unrelated to disease (FAP). 

Sequencing of variant conformers found among controls 
as well as individuals with APC has revealed the following 
polymorphisms in the APC gene: first, in exon 11, at position 

25 1458, a substitution of T to C creating an Rsal restriction site 
but no amino acid change; and second, in exon 15, at 
positions 5037 and 5271, substitutions of A to G and G to T, 
respectively, neither resulting in amino acid substitutions. 
These nucleotide polymorphisms in the APC gene sequence 

30 may be useful for diagnostic purposes. 



This example shows the structure of the APC gene. 
The structure of the APC gene is schematically shown in 
35 FIG. 8, with flanking intron sequences indicated (SEQ ID 
NO:ll-38). 

iS7 5he continuity of the very large (65 kb), most 3' exon in 
' ' DP2i was shown in two ways. First, inverse PCR with 
primers spuming the entire length of mis exon revealed no 
40 divergence of the cDNA sequence from the genomic 
sequence. Second, PCR amplification with converging prim- 
ers placed at intervals along the exon generated products of 
the same size whether amplified from the originally isolated 
cDNA, cDNA from various tissues, or genomic template. 
45 Two forms of exon 9 were found in DP2.5: one is the 
complete exon; and the other, labeled exon 9A, is the result 
of a splice into the interior of the exon that deletes bases 934 
to 1236 in the mRNA and removes 101 amino acids from the 
predicted protein (see FIG. 3, SEQ ID NO:l and 2)J 



This example demonstrates the mapping of the FAP 
deletions with respect to the APC exons. 

5< Somatic cell hybrids carrying the segregated chromo- 
somes 5 from the 100 kb (HHW1291) and 260 Id) 
(HHW1155) deletion patients were used to determine the 
distribution of the APC genes exons across the deletions. 
DNAs from these cell lines were used as template, along 

60 with genomic DNA from a normal control, for PCR-based 
amplification of the APC exons. 

PCR analysis of the hybrids from the 260 kb deletion of 
patient 3214 showed that all but one (exon 1) of the APC 
exons are removed by this deletion. PCR analysis of the 

65 somatic cell hybrid HHW1291, carrying the chromosome 5 
homolog with the 100 kb deletion from patient 3824, 
revealed that exons 1 through 9 are present but exons 10 
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EXAMPLE 13 
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through 15 are missing. This result placed the deletion 
breakpoint either between exons 9 and 10 or within exon 10. 

EXAMPLE 14 

This example demonstrates the expression of alternately 
spliced APC messenger in normal tissues and in cancer cell 
lines. 

Tissues that express the APC gene were identified by PCR 
amplication of cDNAmade to nnWAw^ io 
within adjacent APC exons. In addition, PCR primers mat 
flank the alternatively spliced exon 9 were chosen so that the 
expression pattern of both splice forms could be assessed 
All tissue types tested (brain, lung, aorta, spleen, heart, 
kidney, liver, stomach, placenta, and colonic mucosa) and 15 
cultured cell lines (lymphoblasts, HL60, and 
choriocarcinoma) expressed both splice forms of the APC 
gene. We note, however, that expression by lymphocytes 
normally residing in some tissues, including colon, prevents 
unequivocal assessment of expression. The large mRNA, ^ 
containing the complete exon 9 rather man only exon 9A, 
appears to be the more abundant message. 

Northern analysis of poly(A)-selected RNA from lym- 
phoblasts revealed a single band of approximately 10 kb, 
consistent with the size of the sequenced cDNA. 25 

EXAMPLE 15 

This example discusses structural features of the APC 
protein predicted from the sequence, ^ 
kl3£fhe cDNA consensus sequence of APC predicts mat the 
longer, more abundant form of the message codes for a 2842 
or 2844 amino acid peptide with a mass of 311.8 kd. This 
predicted APC peptide was compared with the current data 
bases of protein and DNA sequences using both Inteffige- 35 
netics and GCG software packages. No genes with a high 
degree of amino arid sequence similarity were found. 
Although many short (approximately 20 amino arid) regions 
of sequence similarity were uncovered, none was suffi- 
ciently strong to reveal which, if any, might represent ^ 
functional homology. Interestingly, multiple similarities to 
myosins and keratins did appear. The APC gene also was 
scanned for sequence motifs of known function; although 
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multiple glycosylation, phosphorylation, and rnyristoylation 
sites were seen, their significance is uncertain^ 



Analysis of the APC peptide sequence did identify fea- 
tures important in considering potential protein structure. 
Hydropathy plots (Kyte and Doolittle, J. Mol. BioL Vol 157, 
pp. 105-132 (1982)) indicate that the APC protein is notably 
hydrophilic. No hydrophobic domains suggesting a signal 

io peptide or a membrane-spanning domain were found. 
Analysis of the first 1000 residues indicates that ct-helical 
rods may form (Cohen and Parry, Trends Biochem, ScL Vol 
77. pp. 245-248 (1986); there is a scarcity of proline 
residues and, there are a number of regions containing 

15 heptad repeats (apolar-X-X-apolar-X-X-X). Interestingly, in 
exon 9A, the deleted form of exon 9, two heptad repeat 
regions are reconnected in the proper heptad repeat frame, 
deleting the intervening peptide region. After the first 1000 

2Q residues, the high proline content of the remainder of the 
peptide suggests a compact rather than a rod-like structure. 
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40 



The most prominent feature of the second 1000 residues 
is a 20 amino add repeat that is iterated seven times with 
semiregular spacing CTable 4). The intervening sequences 
between the seven repeat regions contained 114, 116, 151, 
205, 107, and 58 amino acids, respectively. Finally, residues 
2200-24000 contain a 200 amino acid basic domain. 

{jTABLE IV 

Seven Different Versions of the 20-Ammo Acid Repeat 

Consensus: F*VE*TP*CFSR*SSLSSLS 

1262: YCVEDTPICFSRCSSLSSLS 

1376: HTVQETPLMFSRCTSVSSLD 

1492: FATESTPDGFSCSSSLSALS 

1643: YC VEGTPINF STATSLSDLT 

1848: TPIEGTPYCFSRNDSLSSLD 

1953: FAIENTPVCPSHNSSLSSLS 

2013: RHVEDTPVCFSRNSSLSSLS 



Numbers denote me first amino acid of each repeat The consensus sequence 
at me top reflects a majority amino acid at a given positkxLj 



